Enhanced input peripheral

ABSTRACT

A mouse having X- and Y-position encoders, and associated circuitry for generating X- and Y-movement data, additionally includes an optical sensor for producing grey-scale image data, thereby allowing the mouse to serve both as a positioning device and an optical input device. Desirably, the substrate on which the optical sensor is formed also includes a steganographic decoder. This decoder can enable a variety of functionality, such as linking to web addresses steganographically encoded in print media (e.g., by subtle inking variations or by texture). In other embodiments, such a peripheral is provided without mouse-like functionality, and again permits reading machine-readable indicia printed in catalogs and the like.

RELATED APPLICATION DATA

This application is a continuation-in-part of copending allowedapplication Ser. No. 09/343,101, filed Jun. 29, 1999, which is acontinuation in part both of application Ser. No. 09/314,648, filed May19, 1999 (now U.S. Pat. No. 6,681,028), and provisional application60/134,782, also filed May 19, 1999 (attached as Appendix A).

This application is also a continuation-in-part of copending applicationSer. No. 09/679,261 (attached as Appendix B), which claims priority fromprovisional application 60/158,015, filed Oct. 6, 1999.

The subject matter of this application is also related to that of theassignee's other patents and applications, as exemplified by U.S. Pat.No. 5,841,978.

FIELD OF THE INVENTION

The present invention relates optical user interfaces that sensedigitally-encoded objects. The invention further relates to systemsusing such optical interfaces to control computers, and to navigate overor act as portals on networks.

BACKGROUND AND SUMMARY OF THE INVENTION

“Bedoop.” That might be the sound that someone might hear as they lazilyplace a magazine advertisement in front of their desktop camera.Magically, the marketing and sales web site associated with the ad isdisplayed on their computer. More information? Want to buy now? Look atthe full product line? No problem.

“Bedoop.” That might be the same sound when that same someone placestheir credit card in front of their desktop camera. Instantly, theproduct displayed on the web page is purchased. Behind the scenes, asecure purchase link is initiated, transmitting all requisiteinformation to the vendor. Twist the credit card clockwise and thepurchaser chooses overnight delivery.

So goes an exemplary embodiment of the invention further described inthis application. Though this example is rather specific, itnevertheless alludes to an indescribably vast array of applicationspossible when a digital camera or other optical sensing device is turnedinto a general purpose user interface device with an intuitive powerthat very well might rival the mouse and the keyboard.

The centerpiece of the invention is that an object or paper productso-scanned contains digital information that can be quickly read andacted upon by an appropriately configured device, computer or appliance.The preferred embodiment envisions that this digital information isaesthetically hidden on objects. These objects have been previously andpro-actively marked with the digital information, using any of the broadranges of printing and processing techniques which are available on themarket and which are widely described in the open literature and patentliterature surrounding digital watermarking.

Be this as it may, though the invention concentrates on flat objectapplications wherein the digital information is often imperceptiblyintegrated into the object, it is certainly not meant to be so limited.Objects can be three dimensional in nature and the information morevisually overt and/or pre-existing (i.e., not “pro-actively” embedded,or not even be “digital,” per se). Different implementationconsiderations attach to these variants. Likewise, though the bulk ofthis disclosure concentrates on objects which have some form of digitalmessage attached thereto, some aspects of the invention may apply toobjects which have no such thing, where the prior arts of patternrecognition and gestural input can be borrowed in combination with thisinvention to effect yet a broader array of applications.

“Bedoop.” The sound that a refrigerator might make, outfitted with asimple camera/processor unit/net connection, as the ten year old holdsup the empty milk carton and a ping goes out to the local grocery store,adding the item to an accumulating delivery list. The sound that mightbe heard echoing over and over inside Internet cafés as heretoforecomputerphobes take their first skeptical steps onto the world wide web.The sound heard at the fast food counter as the repeat customer holds uptheir sandwich card ticking off their latest meal, hoping for the sirensto go off for a $500 prize given to the lucky customer of the week. Bluesky scenarios abound.

This invention is therefore about powerful new user interfaces tocomputers involving optical input. These new user interfaces extend intothe everyday world in ways that a mouse and keyboard never could. Byenabling everyday objects to communicate their identities and functionsto ever-attendant devices, not only will the world wide web be given anentirely new dimension, but basic home and office computing may be instore for some fundamental advances as well.

These and a great many other features of the present invention will bemore readily apparent from the following detailed description, whichproceeds with reference to the accompanying drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a block diagram showing one embodiment of the presentinvention.

FIG. 2 is another block diagram showing an embodiment of the presentinvention.

DETAILED DESCRIPTION

Basically, the technology detailed in this disclosure may be regarded asenhanced systems by which users can interact with computer-baseddevices. Their simple nature, and adaptability for use with everydayobjects (e.g., milk cartons), makes the disclosed technology well suitedfor countless applications.

Due to the great range and variety of subject matter detailed in thisdisclosure, an orderly presentation is difficult to achieve. As will beevident, many of the topical sections presented below are both foundedon, and foundational to, other sections. For want of a better rationale,the sections are presented below in a more or less random order. Itshould be recognized that both the general principles and the particulardetails from each section find application in other sections as well. Toprevent the length of this disclosure from ballooning out of control,the various permutations and combinations of the features of thedifferent sections are not exhaustively detailed. The inventors intendto explicitly teach such combinations/permutations, but practicalityrequires that the detailed synthesis be left to those who ultimatelyimplement systems in accordance with such teachings.

Basic Principles—Refrigerators and Clutter

Referring to FIG. 1, a basic embodiment 10 of the present inventionincludes an optical sensor 12, a computer 14, and a network connection16 to the internet 18. The illustrated optical sensor 12 is a digitalcamera having a resolution of 320 by 200 pixels (color or black andwhite) that stares out, grabbing frames of image data five times persecond and storing same in one or more frame buffers. These frames ofimage data are analyzed by a computer 14 for the presence of Bedoopdata. (Essentially, Bedoop data is any form of digital data encodingrecognized by the system 10—data which, in many embodiments, initiatessome action.) Once detected, the system responds in accordance with thedetected Bedoop data (e.g., by initiating some local action, or bycommunication with a remote computer, such as over the internet, via anonline service such as AOL, or using point-to-point dial-upcommunications, as with a bulletin board system.

Consider the milk carton example. The artwork on a milk carton can beadapted to convey Bedoop data. In the preferred embodiment, the Bedoopdata is steganographically encoded (e.g., digitally watermarked) on thecarton. Numerous digital watermarking techniques are known—all of whichconvey data in a hidden form (i.e., on human inspection, it is notapparent that digitally encoded data is present). Exemplary techniquesoperate by slightly changing the luminance, or contours, of selectedpoints on artwork or text printed on the carton, or splatter tinydroplets of ink on the carton in a seemingly random pattern. Each ofthese techniques has the effect of changing the local luminance at areasacross the carton—luminance changes that can be detected by the computer14 and decoded to extract the encoded digital data. In the case of amilk carton, the data may serve to identify the object as, e.g., a halfgallon carton of Alpenrose brand skim milk.

The FIG. 1 apparatus can be integrated into the door of a refrigeratorand used to compile a shopping list. Milk cartons, and otherBedoop-encoded packaging 20, can be held up the optical sensor. When thecomputer 14 detects the presence of Bedoop data and successfully decodessame, it issues a confirmation tone (“be-doop”) from a speaker or otheraudio transducer 22. The computer then adds data identifying thejust-detected object to a grocery list. This list can be maintainedlocally (in disk storage, non-volatile RAM 24, or the like in therefrigerator, or elsewhere in the home), or remotely (e.g., at a serverlocated at a user-selected grocery, or elsewhere). In either event, thelist is desirably displayed on a display in the user's home (e.g., anLCD screen 26 built into the front of the appliance). Conventional userinterface techniques can be employed permitting the user to scrollthrough the displayed list, delete items as desired, etc.

Periodically, the listed groceries can be purchased and the listcleared. In one embodiment, the list is printed (either at the home orat the grocery), and the user walks the grocery aisles and purchasessame in the conventional manner. In another embodiment, the grocer pullsthe listed items from the shelves (in response to a user requestconveyed by the internet or telephone, or by a gesture as hereafterdetailed). Once the list has been pulled, the grocer can alert the userthat the groceries are available for pickup (again, e.g., by internet ortelephone message), or the grocer can simply deliver the groceriesdirectly to the user's home. Naturally, on-line payment mechanisms canbe employed if desired.

Consider a wholly unrelated Bedoop application. An Excel spreadsheet isprinted onto paper, and the paper becomes buried in a stack of clutteron an office worker's desk. Months later the spreadsheet again becomesrelevant and is dug out of the stack. Changes need to be made to thedata, but the file name has long-since been forgotten. The worker simplyholds the dug-out page in front of a camera associated with the desktopcomputer. A moment later, the electronic version of the file appears onthe worker's computer display.

When the page was originally printed, tiny droplets of ink or toner weredistributed across the paper in a pattern so light as to be essentiallyun-noticeable, but which steganographically encoded the page with aplural-bit binary number (e.g., 64 bits). A database (e.g., maintainedby the operating system, the Excel program, the printer driver, etc.)stored part of this number (e.g., 24 bits, termed a Univeral Identifieror UID) in association with the path and file name at which theelectronic version of the file was stored, the page number within thedocument, and other useful information (e.g., author of the file,creation date, etc.).

The steganographic encoding of the document, and the updating of thedatabase, can be performed by the software application (e.g., Excel).This option can be selected once by the user and applied thereafter toall printed documents (e.g., by a user selection on an “Options”drop-down menu), or can be presented to the user as part of the Printdialog window and selected (or not) for each print job.

When such a printed page is later presented to the camera, the computerautomatically detects the presence of the encoded data on the page,decodes same, consults the database to identify the filename/location/page corresponding to the UID data, and opens theidentified file to the correct page (e.g., after launching Excel). Thisapplication is one of many “paper as portal” applications of the Bedooptechnology.

The foregoing are but two of myriad applications of the technologydetailed herein. In the following discussion a great many otherapplications are disclosed (some groundbreaking, a few gimmicky).However, regardless of the length of the specification, it is possibleonly to begin to explore a few of the vast ramifications of thistechnology.

A few more details on the basic embodiments described above may behelpful before delving into other applications.

Optics

For any system to decode steganographically encoded data from an object,the image of the object must be adequately focused on the digitalcamera's CCD (or other) sensor. In a low cost embodiment, the camera hasa fixed nominal focal length, e.g., in the range of 6-24 inches (greateror lesser lengths can of course be used). Since the camera iscontinuously grabbing and analyzing frames of data, the user can movethe object towards- or away-from the sensor until the decoder succeedsin decoding the steganographically encoded data and issues a confirming“Bedoop” audio signal.

In more elaborate embodiments, known auto-focusing technology can beemployed.

In still other embodiments, the camera (or other sensor) can be equippedwith one or more auxiliary fixed-focus lenses that can be selectivelyused, depending on the particular application. Some such embodimentshave a first fixed focused lens that always overlies the sensor, withwhich one or more auxiliary lenses can be optically cascaded (e.g., byhinge or slide arrangements). Such arrangements are desirable, e.g.,when a camera is not a dedicated Bedoop sensor but also performs otherimaging tasks. When the camera is to be used for Bedoop, the auxiliarylens is positioned (e.g., flipped into) place, changing the focal lengthof the first lens (which may by unsuitably long for Bedoop purposes,such as infinity) to an appropriate Bedoop imaging range (such as onefoot).

Other lens-switching embodiments do not employ a fixed lens that alwaysoverlies the sensor, but instead employ two or more lenses that can bemoved into place over the sensor. By selecting different lenses, focallengths such as infinity, six feet, and one foot can be selected.

In all such arrangements, it is desirable (but not essential) that thesteganographically-encoded portion of the object being imaged fills asubstantial part of the image frame. The object can be of various sizes,e.g., an 10 by 12 inch front panel of a cereal box, or a proof ofpurchase certificate that is just one inch square. To meet thisrequirement, small objects will obviously need to be placed closer tothe camera than large objects. The optics of the system can be designed,e.g., by selection of suitable aperture sizing and auxiliary lighting(if needed), to properly image objects of various sizes within a rangeof focal distances.

Some embodiments avoid issues of focal distances and identifying theintended object by constraining the size of the object and/or itsplacement. An example is a business card reader that is designed for thesole task of imaging business cards. Various such devices are known.

Decoding/Encoding

The analysis of the image data can be accomplished in various knownways. Presently, most steganographic decoding relies on general purposemicroprocessors that are programmed by suitable software instructions toperform the necessary analysis. Other arrangements, such as usingdedicated hardware, reprogrammable gate arrays, or other techniques, canof course be used.

The steganographic decoding process may entail three steps. In thefirst, the object is located. In the second, the object's orientation isdiscerned. In the third, the Bedoop data is extracted from the imagedata corresponding to the Bedoop object.

The first step, object location, can be assisted by various clues. Oneis the placement of the object; typically the center of the image fieldwill be a point on the object. The surrounding data can then be analyzedto try and discern the object's boundaries.

Another location technique is slight movement. Although the user willtypically try to hold the object still, there will usually be somejitter of the Bedoop object within the image frame (e.g., a few pixelsback and forth). Background visual clutter, in contrast, will typicallybe stationary. Such movement may thus be sensed and used to identify theBedoop object from within the image data.

Still another object-location clue is object shape. Many Bedoop objectsare rectangular in shape (or trapezoidal as viewed by the camera).Straight edge boundaries can thus be used to define an area of likelyBedoop data.

Color is a further object identification clue that may be useful in somecontexts. Yet another object location clue is spatial frequency. Inimaging systems with well defined focal zones, undesired visual cluttermay be at focal distances that results in blurring. The Bedoop object,in contrast, will be in focus and may be characterized by fine detail.Analyzing the image data for the high frequencies associated with finedetail can be used to distinguish the intended object from others.

Characteristic markings on the object (as discussed below in connectionwith determining object orientation), can also be sensed and used inlocating the object.

Once the Bedoop object has been located within the image data, maskingcan be applied (if desired) to eliminate image data not corresponding tothe intended object.

The second step in the decoding process-determining orientation of theBedoop data-can likewise be discerned by reference to visual clues. Forexample, some objects include subliminal graticule data, or othercalibration data, steganographically encoded with the Bedoop data to aidin determining orientation. Others can employ overt markings, eitherplaced for that sole purpose (e.g. reference lines or fiducials), orserving another purpose as well (e.g. lines of text), to discernorientation. Edge-detection algorithms can also be employed to deducethe orientation of the object by reference to its edges.

Some embodiments filter the image data at some point in the process toaid in ultimate Bedoop data extraction. One use of such filtering is tomitigate image data artifacts due to the particular optical sensor. Forexample, CCD arrays have regularly-spaced sensors that sample theoptical image at uniformly spaced discrete points. This discretesampling effects a transformation of the image data, leading to certainimage artifacts. An appropriately configured filter can mitigate theeffect of these artifacts.

(In some arrangements, the step of determining the orientation can beomitted. Business card readers, for example, produce data that isreliably free of artifacts and is of known scale. Or the encoding of theBedoop data can be effected in such a way that renders it relativelyimmune to certain distortion mechanisms. For example, while thepresently-preferred encoding arrangement operates on a 2D grid basis,with rows and columns of data points, the encoding can alternatively bedone on another basis (e.g., a rotationally-symmetric form of encoding,such as a 2D bar-code, so that rotational state of the image data can beignored). In still other embodiments, the orientation-determining stepcan be omitted because the decoding can readily proceed without thisinformation. For example decoding which relies on the Fourier-Mellintransform produces data in which scale and rotation can be ignored.)

Once the orientation of the object is discerned, the image data may bevirtually re-registered, effectively mapping it to another perspective(e.g., onto a rectilinear image plane). This mapping can employ knownimage processing techniques to compensate, e.g., for rotation state,scale state, differential scale state, and X-Y offset, of the originalBedoop image data. The resulting frame of data may then be more readilyprocessed to extract the steganographically-encoded Bedoop data.

In the preferred embodiment, after the image data is remapped intorectilinear planar form, subliminal graticule data is sensed thatidentifies the locations within the image data where the binary data isencoded. Desirably, the binary data is redundantly encoded, e.g., in 8×8patch blocks. Each patch comprises one or more pixels. (The patches aretypically square, and thus contain 1, 4, 9, or 16, etc. pixels.) Thenominal luminance of each patch before encoding (e.g., artworkpre-existing on the object) is slightly increased or decreased to encodea binary “1” or “0.” The change is slight enough to be generallyimperceptible to human observers, yet statistically detectable from theimage data—especially if several such blocks are available for analysis.Preferably, the degree of change is adapted to the character of theunderlying image, with relatively greater changes being made in regionswhere the human eye is less likely to notice them. Each block thusencoded can convey 64 bits of data. The encoding of such blocks in tiledfashion across the object permits the data to be conveyed in robustfashion.

Much of the time, of course, the Bedoop sensor is staring out andgrabbing image frames that have no Bedoop data. Desirably, the detectionprocess includes one or more checks to assure that Bedoop data is notwrongly discerned from non-Bedoop image data. Various techniques can beemployed to validate the decoded data, e.g., error detecting codes canbe included in the Bedoop payload and checked to confirm correspondencewith the other Bedoop payload. Likewise, the system can confirm that thesame Bedoop data is present in different tiled excerpts within the imagedata, etc.

(Details of the preferred encoding techniques are further detailed inapplication Ser. No. 09/293,601, filed Apr. 15, 1999, entitled METHODSAND DEVICES FOR RECOGNIZING BANKNOTES AND RESPONDING ACCORDINGLY (nowU.S. Pat. No. 6,427,020), Ser. No. 09/127,502, filed Jul. 31, 1998 (nowU.S. Pat. No. 6,345,104), and U.S. Pat. No. 5,862,260.)

Data Structures, Formats, Protocols, and Infrastructures

In an exemplary system, the Bedoop data payload is 64 bits. This payloadis divided into three fields CLASS (12 bits), DNS (24 bits) and UID (24bits). (Other payload lengths, fields, and divisions, are of coursepossible, as is the provision of error-checking or error-correctingbits.)

Within the above-described eight patch-by-eight patch data block, thebits are ordered row by row, starting with the upper left patch. Thefirst 12 bits are the CLASS ID, followed by 24 bits of DNS data followedby 24 bits of UID data. (In other embodiments, the placement of bitscomprising these three fields can be scrambled throughout the block.)

Briefly, the CLASS ID is the most rudimentary division of Bedoop data,and may be analogized, in the familiar internet taxonomy, to the limitednumber of top level domains (e.g., .com, .net, .org, .mil, .edu, .jp,.de, .uk, etc.). It is basically an indicator of object type. The DNS IDis an intermediate level of data, and may be analogized to internetserver addresses (e.g., biz.yahoo, interactive.wsj, etc.) The UID is thefinest level of granularity, and can roughly be analogized to internetpages on a particular server (e.g., edition/current/summaries/front.htm,daily/home/default.hts, etc.).

Generally speaking, the CLASS ID and DNS ID, collectively, indicate tothe system what sort of Bedoop data is on the object. In the case ofBedoop systems that rely on remote servers, the CLASS and DNS IDs areused in identifying the server computer that will respond to the Bedoopdata. The UID determines precisely what response should be provided.

In the case of a refrigerator Bedoop system, what happens if an objectwith an unfamiliar CLASS/DNS ID data is encountered? The system can beprogrammed not to respond at all, or to respond with a raspberry-likesound (or other feedback) indicating “I see a Bedoop object but don'tknow what to do with it.”

Most systems will be able to respond to several classes of Bedoopobjects. Simple software-based systems can compare the CLASS/DNS ID (andoptionally the UID) to fixed values, and can branch program execution tocorresponding subroutines. Likewise, hardware-based systems can activatedifferent circuitry depending on the detected CLASS/DNS ID.

In the case of a computer equipped with a Bedoop input device (e.g., aSony VAIO PictureBook laptop with built-in camera), the operatingsystem's registry database can be employed to associate differentapplication programs with different CLASS/DNS IDs (just as the .XLS and.DOC file extensions are commonly associated by existing operatingsystem registries to invoke Microsoft Excel and Word softwareapplications, respectively). When a new Bedoop application is installed,it logs an entry in the registry database indicating the CLASS/DNS ID(s)that it will handle. Thereafter, when an object with such a CLASS/DNS IDis encountered, the operating system automatically launches thecorresponding application to service the Bedoop data in an appropriatemanner.

Sometimes the computer system may encounter a Bedoop object for which itdoes not have a registered application program. In such case, a defaultBedoop application can be invoked. This default application can, e.g.,establish an internet link to a remote server computer (or a network ofsuch computers), and can transmit the Bedoop data (or a part of theBedoop data) to that remote computer. The remote server can undertakethe response itself, it can instruct the originating computer how torespond appropriately, or it can undertake some combination of these tworesponses. (Such arrangements are further considered below.)

FIG. 2 shows an illustrative architecture employing the foregoingarrangement.

At a local Bedoop system 28 (which may be implemented, for example,using a conventional personal computer 29), a camera, scanner, or otheroptical sensor 30 provides image data to a decoder 32 (which may beimplemented as a software component of the operating system 33). Thedecoder 32 analyzes the image data to discern the plural-bit Bedoopdata. The CLASS ID of this Bedoop data is applied to a Bedoop registry34. The registry responds by identifying and launching a local Bedoopapplication 36 designed to service the discerned Bedoop data.

Sometimes the system 28 may encounter a Bedoop object for which severaldifferent responses may be appropriate. In the case of a printed officedocument, for example, one response may be as described above—to presentthe electronic version of the file on a computer, ready for editing. Butother responses may also be desired, such as writing an email message tothe author of the printed document, with the author's email addressalready specified in the message address field, etc.

Such different responses may be handled by different Bedoopapplications, or may be options that are both provided by a singleBedoop application. In the former case, when the CLASS/DNS IDs aredecoded and provided to the operating system, the registry indicatesthat there are two (or more) programs that might be invoked. Theoperating system can then present a dialog box to the user inviting theuser to specify which form of response is desired. Optionally, a defaultchoice can be made if the user doesn't specify within a brief period(e.g., three seconds). The operating system can then launch the Bedoopapplication corresponding to the chosen response.

A similar arrangement can be employed if a single Bedoop application canprovide both responses. In such case the operating system launches thesingle Bedoop application (since there is no ambiguity to be resolved),and the application presents the choice to the user. Again, the user canselect, or a default choice can be automatically made.

In the just-described situations, the user can effect the choice byusing the keyboard or mouse—as with traditional dialog boxes. But Bedoopprovides another, usually easier, form of interaction. The user can makethe selection through the optical sensor input. For example, moving theobject to the right can cause a UI button on the right side of thedialog box to be selected; moving the object to the left can cause a UIbutton on the left side of the dialog box to be selected; moving theobject towards the camera can cause the selected button to be activated.Many other such techniques are possible, as discussed below.

If the registry 34 does not recognize, or otherwise does not know how torespond to Bedoop data of that particular CLASS/DNS, the registrylaunches a default Bedoop client application. This client application,in turn, directs a web browser 40 on the local Bedoop system 28 tocommunicate with a remote master registration server computer 42. Thelocal computer forwards the Bedoop data to this master server. Themaster server 42 examines the CLASS ID, and forwards the Bedoop data(directly, or through intervening servers) to a corresponding CLASSserver 44. (A single server may handle Bedoop data of several classes,but more typically there is a dedicated server for each CLASS.)

Each CLASS server 44 serves as the root of a tree 46 of distributed DNSservers. A DNS server 48 a, for example, in a first tier 50 of the DNSserver tree, may handle Bedoop data having DNS IDs beginning with “000.”Likewise, DNS server 48 b may handle Bedoop data having DNS IDsbeginning with “001,” etc., etc.

Each DNS server in the first tier 50 may, in turn, route Bedoop data toone of 8 servers in a second tier of the tree, in accordance with thefourth-through sixth bits of the DNS data. The tree continues in thisfashion until a terminal level of DNS leaf node servers 56.

Ultimately, Bedoop data routed into this network reaches a DNS leaf nodeserver 56. That leaf node server may handle the Bedoop data, or mayredirect the local Bedoop system to a further server 58 that does so.That ultimate server—whether a DNS leaf node server or a furtherserver—can query the local Bedoop system for further information, ifnecessary, and can either instruct the local Bedoop system how torespond, or can undertake some or all of the response itself and simplyrelay appropriate data back to the local Bedoop system.

In arrangements in which the local Bedoop system is redirected, by theDNS leaf node server, to a further server that actually handles theresponse, access to the further server may be through a port 59 (e.g., aspecial URL) tailored to receipt of Bedoop data.

In a typical implementation, most or all of the servers are mirrored, orotherwise replicated/redundant, so that failure of individual computersdoes not impair operation of the system.

Caching can be provided throughout the trees of servers to speedresponses. That is, responses by leaf nodes for certainlycommonly-encountered CLASS/DNS IDs can be temporarily stored earlier inthe tree(s). Bedoop data, propagating through the server network, canprompt a response from an intermediate server if there is a cache hit.

If desired, Bedoop traffic through the above-detailed server trees canbe monitored to collect demographic and statistical information as towhat systems are sending what Bedoop data, etc. One use of suchinformation is to dynamically reconfigure the DNS network to betterbalance server loads, to virtually relocate DNS resources nearer regionsof heavy usage, etc. Another use of such information is for marketingpurposes, e.g., to promote certain Bedoop features and applicationswithin user groups (e.g., internet domains) that seem to under-utilizethose features.

Within certain user networks that are linked to the internet, e.g.,corporate networks, Bedoop data that isn't handled within theoriginating Bedoop system may first be routed to a Bedoop name serverwithin the corporate network. That server will recognize certain typesof Bedoop data, and know of resources within the corporate networksuitable for handling same. Referral to such resources within thecorporate network will be made, where possible. These resources (e.g.,corporate servers) may respond to Bedoop data in a way customized to thecorporate preferences. If the corporate Bedoop name server does not knowof a resource within the corporate network that can respond to theBedoop data, the corporate name server then routes the data to thepublic Bedoop network described above. (Such referral can be to themaster registration server or, to the extent the corporate name serverknows the addresses of appropriate servers within the DNS server tree,or of the further servers to which DNS servers may point for certainBedoop data, it can redirect the local Bedoop system accordingly.)

In typical rich Bedoop implementations, local systems may have librariesof Bedoop services, applications, or protocols. Some may be unique tothat computer. Others may be commonly available on all computers. Somemay be highly secure, employing encryption and/or anti-hacking measures,or data protocols that are not generally recognized. Others may beshareware, or the result of open-source programming efforts.

Greeting Cards, Birthday Cards, Etc.

In accordance with a further embodiment of the invention, greeting cardsand the like are encoded (e.g., by texturing, printing, etc.) withBedoop data. On receiving such a card, a recipient holds it in front ofthe image capture device on a laptop or other computer. The computerresponds by displaying an internet web page that has a stock- orcustomized-presentation (image, video, audio-video, etc.) to complementthat presented on the greeting card.

The web site presentation can be personalized by the sender (e.g., witha text message, recent family photographs, etc.), either at the point ofcard sale, or sometime after the card is purchased. In the latter case,for example, the card can be serialized. After taking the card home, thepurchaser can visit the card vendor's web site and enter the card serialnumber in an appropriate user interface. The purchaser is then presentedwith a variety of simple editing tools to facilitate customization ofthe web greeting. When the sender is finished designing the webgreeting, the finished web page data is stored (by software at thevendor's web site) at a site corresponding to the serial number.

When the card is received by a recipient and held in front of a Bedoopsensor, CLASS, DNS, and UID data is decoded from the card. The CLASS andDNS data are used to navigate the earlier-described server network toreach a corresponding DNS leaf node server (perhaps maintained by theHallmark greeting card company). That leaf node server indexes a table,database, or other data structure with the UID from the Bedoop data, andobtains from that data structure the address of an ultimate web site—thesame address at which the web greeting customized by the sender wasstored. That address is provided by the DNS leaf node server back to thelocal computer, with instructions that the web page at that address beloaded and displayed (e.g., by HTML redirection). The local computercomplies, presenting the customized web greeting to the card recipient.

In the just-described embodiment, in which a pre-encoded card ispurchased by a sender and the web-display is then customized, theaddress of the web site is typically determined by the card vendor. Butthis need not be the case. Likewise, the card need not be “purchased” inthe typical, card-shop fashion.

To illustrate the foregoing alternatives, consider the on-lineacquisition of a greeting card, e.g., by visiting a web sitespecializing in greeting cards. With suitable user-selection (and,optionally, customization), the desired card can be printed using anink-jet or other printer at the sender's home. In such case, the Bedoopdata on the card can be similarly customized. Instead of leading to asite determined by the card vendor, the data can lead to the sender'spersonal web page, or to another arbitrary web address.

To effect such an arrangement, the sender must arrange for a DNS leafnode server to respond to a particular set of Bedoop data by pointing tothe desired web page. While individuals typically will not own DNSservers, internet service providers commonly will. Just as AOL providessimple tools permitting its subscribers to manage their own modest webpages, internet service providers can likewise provide simple toolspermitting subscribers to make use of DNS leaf node servers. Eachsubscriber may be assigned up to 20 UIDs (under a particular CLASS andDNS). The tools would permit the users to define a corresponding webaddress for each UID. Whenever a Bedoop application led to that DNS leafnode server, and presented one of those UIDs, the server would instructthe originating computer to load and present the web page at thecorresponding web address 58.

Prior to customizing the greeting card, the sender uses the toolprovided by the internet service provider to store the address of adesired destination web address in correspondence with one of thesender's available UIDs. When customizing the greeting card, the senderspecifies the Bedoop data that is to be encoded, including thejust-referenced UID. The greeting card application encodes this datainto the artwork and prints the resulting card. When this card is laterpresented to a Bedoop system by the recipient, the recipient's systemloads and displays the web page specified by the sender.

Commerce in Bedoop Resources

In the just-described arrangement, internet service providers makeavailable to each subscriber a limited number of UIDs on a DNS servermaintained by the service. Business enterprises typically need greaterBedoop resources, such as their own DNS IDs (or even their own CLASSID(s).

While variants of the Bedoop system are extensible to provide anessentially unlimited number of CLASS IDs and DNS IDs, in theillustrated system these resources are limited. Public service,non-profit, and academic applications should have relatively generousaccess to Bedoop resources, either without charge or for only a modestcharge. Business enterprises, in contrast, would be expected to pay feesto moderate their potentially insatiable demand for the resources. Smallbusinesses could lease blocks of UIDs under a given CLASS/DNS ID. Largerbusinesses could acquire rights to entire DNS IDs, or to entire CLASSIDs (at commensurately greater fees).

Web-based systems for assigning DNS IDs (and CLASS IDs) can be modeledafter those successfully used by Internic.com, and nowNetworksolutions.com, for registration of internet domains. The userfills out a web-based form with names, addresses, and billinginformation; the system makes the necessary changes to all of the hiddensystem infrastructure—updating databases, routing tables, etc., inservers around the world.

Controlled-Access ID

Just as the above-described embodiment employed an ink-jet printer toproduce a customized-Bedoop greeting card, the same principles canlikewise be applied to access-control objects, such as photo-IDs.

Consider an employment candidate who will be interviewing at a newemployer. The candidate's visit is expected, but she is not recognizedby the building's security personnel. In this, and many otherapplications, arrangements like the following can be used:

The employer e-mails or otherwise sends the candidate an access code.(The code can be encrypted for transmission.) The code is valid only fora certain time period on a given date (e.g., 9:00 a.m.-11:00 a.m. onJun. 29, 1999).

Upon receipt of the access code, the candidate downloads from the website of the state Department of Motor Vehicles the-latest copy of herdriver's license photo. The DMV has already encoded this photo withBedoop data. This data leads to a state-run DNS leaf node server 56.When that server is presented with a UID decoded from a photograph, theserver accesses a database and returns to the inquiring computer a textstring indicating the name of the person depicted by the photograph.

The candidate incorporates this photo into an access badge. Using asoftware application (which may be provided especially for suchpurposes, e.g., as part of an office productivity suite), the photo isdragged into an access badge template. The access code emailed from theemployer is also provided to this application. On selecting “Print,” anink-jet printer associated with the candidate's computer prints out anaccess badge that includes her DMV photo and her name, and is alsosteganographically encoded in accordance with the employer-providedaccess code.

The name printed on the badge is obtained (by the candidate's computer)from the DMV's DNS server, in response to Bedoop data extracted from thephotograph. (In this application, unlike most, the photograph is notscanned as part of a Bedoop process. Instead, the photograph is alreadyavailable in digital form, so the Bedoop decoding proceeds directly fromthe digital representation.)

For security purposes, the access code is not embedded using standardBedoop techniques. Instead, a non-standard format (typicallysteganographic) is employed. The embedding of this access code can spanthe entire face of the card, or can be limited to certain regions (e.g.,excluding the region occupied by the photograph).

On the appointed day the candidate presents herself at the employer'sbuilding. At the exterior door lock, the candidate presents the badge toan optical sensor device, which reads the embedded building access code,checks it for authenticity and, if the candidate arrived within thepermitted hours, unlocks the door.

Inside the building the candidate may encounter a security guard. Seeingan unfamiliar person, the guard may visually compare the photo on thebadge with the candidate's face. Additionally, the guard can present thebadge to a portable Bedoop device, or to one of many Bedoop systemsscattered through the building (e.g., at every telephone). The Bedoopsystem extracts the Bedoop data from the card (i.e., from the DMVphotograph), interrogates the DMV's DNS server with this Bedoop data,and receives in reply the name of the person depicted in the photograph.(If the Bedoop system is a telephone, the name may be displayed on asmall LCD display commonly provided on telephones.)

The guard checks the name returned by the Bedoop system with the nameprinted on the badge. On seeing that the printed and Bedoop-decodednames match (and optionally checking the door log to see that a personof that name was authorized to enter and did so), the security guard canlet the candidate pass.

It will be recognized that the just-described arrangement offers veryhigh security, yet this security is achieved without the candidate everpreviously visiting the employer, without the employer knowing what thecandidate looks like, and by use of an access badge produced by thecandidate herself.

Variants of such home-printed badge embodiments find numerousapplications. Consider purchasing movie- or event-tickets over the web.The user can print an access ticket that has an entry code embeddedtherein. On arriving at the theater or event, the user presents theticket to an optical scanning device, which decodes the entry code,checks the validity of same, authorizes the entry, and marks that entrycode as having been used (preventing multiple uses of tickets printedwith the same code).

Ink-Jet Printing

In the foregoing discussions, reference has been made to use of ink-jetprinting as a means for providing steganographically encoded indicia onsubstrates. The following discussion expands on some of the operativeprinciples.

The basic physics and very low level analog electronic operation ofink-jet printers (sometimes termed bubble-jet printers) are ideallysuited to support very-light-tint background digital watermarking on anyform of substrate. (Watermarking through apparent “tinting” ofsubstrates is discussed in application Ser. No. 09/127,502, now U.S.Pat. No. 6,345,104.) In general, the statement, “if you can print itwith an ink jet printer, you can watermark it” is largely accurate, evenfor (perhaps especially for) simple text documents. Indeed, there is adegree of flexibility and control in the ink-jet printing realm that isnot as generally available in more traditional printing technologies,such as commercial offset printing and other plate-based technologies.(This is not to say that ink-jet has better quality than plate-basedtechnologies; it has more to do with the statistics of ink droplets thananything else.) Heavier tint backgrounds are possible as well, where thecontinuum ranges from very light background tinting, where the casualobserver will see “white paper,” all the way through heavily inkedpatterned backgrounds, and photographs themselves, and everything inbetween.

In some embodiments, the ink-jet driver software is modified to providelower-level control of individual droplet emission than is provided inexisting printer drivers, which are naturally optimized for text andgraphics. In some such embodiments, the “watermarking” print mode isanother option from which the user can select (e.g., in addition to HighQuality, Econo-Fast, etc.), or the selection can be made automaticallyby application software that is printing watermarked data.

In more sophisticated embodiments, the watermark data is applied to theprinter driver software independently of the other image/text data. Theprinter driver is arranged to eject droplets in the usual print densityfor the image/text data, and at a more accurately-controlled, finerdensity for the separately-applied watermark data. (The latter may beeffected as a slight modulation signal on the former.) This arrangementprovides for essentially transparent integration into existing printerenvironments—no one need worry about the watermarking capability exceptthe software applications that specifically make use of same.

Consumer Marking of Web-Based Materials

Various items of printed media can originate off the web, yet be printedat home. Examples include movie tickets, coupons, car brochures, etc.Bedoop data can be added, or modified, by the software application or bythe printer driver at the time of printing. (Alternatively, the Bedoopdata can be customized to correspond to the user before being downloadedto the user's system for printing.)

One advantage to Bedoop-encoding printed images locally, as opposed toBedoop-encoding the image files prior to downloading for local printing,is that the encoding can be tailored in accordance with the particularproperties of the local printer (e.g., to increase robustness ordecrease visibility)—properties not generally known to a remote server.

In one particular example, the UID field in the Bedoop data can bewritten with a value that serves as an index to a database of userprofiles, permitting later systems to which the printed item ispresented to personalize their response in accordance with the profiledata.

In another example, the UID field serves an authentication purpose,e.g., to verify that the printed medium actually was printed at aparticular place, or by a particular user or at a particular time.

Coffee Mug

At retail coffee outlets, customers commonly order the same drink dayafter day (“half-decaf, short, skinny latte”). Some customers presentpersonal coffee mugs to the cashier, preferring the sensation of ceramicor metal to paper, and avoiding the trash/recycle dilemma.

The drinker's “regular” order can be Bedoop-encoded either on the mugitself or, more commonly, on an adhesive label applied to the mug. Theencoding can be in addition to other aesthetic imagery (e.g., artwork ora photo), or the marking can be purely data. Labels the size of postagestamps may be used.

On handing the mug to the cashier, the customer can simply say “theregular.” The cashier passes the mug in front of the optical scanningdevice of a Bedoop system associated with the cash register. The systemsteganographically decodes the data and provides the corresponding order(“half-decaf, short, skinny latte”), either textually or audibly (e.g.,by a voice synthesizer) to the cashier or the barrista. The cashregister system also knows the current price of the requested drink, andrings up the charge accordingly.

Labels of the type described can be available to the cashier onpre-printed rolls, just as with other adhesive stickers, or can beprinted on-demand. (Small label printers may be best suited in thelatter case, given space constraints in retail outlets.) Customersordering drinks for personal mugs may be invited to take a labelcorresponding to their just-ordered drink and apply it to their mug forfuture use.

In variants on this basic theme, the mug label can be further encoded(or a supplemental label can be provided and encoded) with electronicpayment information, such as the customer's credit card number, or thenumber of a debit account maintained by the coffee merchant for thatcustomer. When the mug is scanned for the drink order, the systemlikewise detects the payment information and charges the correspondingfee to the appropriate account. (For security reasons, the system may bearranged so that the mug cannot be used to authorize more than, say $5of coffee drink purchases per day.) In another variant on this theme,the system maintains an electronic log of coffee purchases made by thecustomer and, in accordance with then-prevailing marketingconsiderations, rewards the customer with a free drink after 8 or 12,etc., drinks have been purchased.

In still another variant on this theme, regular customers who useBedoop-labeled mugs can participate in periodic promotions in which, forexample, every N^(th) such customer is rewarded with a cash ormerchandise prize. Bells go off when the Nth mug is scanned. (N can be afixed number, such as 500, or can be a random number—typically within aknown range or with a known mean.)

Smart Elevators

In accordance with another embodiment of the invention, a buildingelevator is provided with one or more optical capture devices. Eachdevice examines monitors the contents of the elevator chamber lookingfor Bedoop encoded objects, such as ID badges.

On sensing a Bedoop-encoded object, the elevator can determine—amongother data—the floor on which the wearer's office is located. The systemcan then automatically direct the elevator to that floor, without theneed for the person to operate any buttons. (The elevator's button panelcan be provided with a new, override button that can be operated toun-select the most recently selected floor(s), e.g., in case a userwants to travel to a different floor.) To aid in identification, theBedoop objects (e.g., badges) can be colored a distinctive color,permitting the system to more easily identify candidate objects fromother items within the optical capture devices' field of view. Or theobject can be provided with a retro-reflective coating, and the elevatorcan be equipped with one or more illumination sources of known spectralor temporal quality (e.g., constant infra red, or constant illuminationwith a single- or multi-line spectrum, or a pulsed light source of knownperiodicity; LEDs or semiconductor lasers, each with an associateddiffuser, can be used for each the foregoing and can be paired with theimage capture devices). Other such tell-tale clues can likewise be usedto aid in object location. In all such cases, the optical capture devicecan sense the tell-tale clue(s) using a wide field of view sensor. Thedevice can then be physically or electronically steered, and/or zoomed,to acquire a higher resolution image of the digitally-encoded objectsuitable for decoding.

Magazines

Magazine (and newspaper) pages can be steganographically encoded withBedoop data to provide another “paper as portal” experience. As with theearlier described office document case, the encoded data yields anaddress to a computer location (e.g., a web page) having the same, orrelated, content.

In one exemplary embodiment, the blank magazine page stock isBedoop-encoded prior to printing. The watermarking can be performed byhigh speed ink-jet devices, which splatter a fine pattern of essentiallyimperceptible ink droplets across each page. Each page can bedifferently watermarked so that, on decoding, page 21 of a magazine canbe distinguished from page 22 of the same magazine (and page 106 of theJun. 21, 1999, issue can be distinguished from page 106 of the Jun. 28,1999, issue). If desired, each page can be further segregated intoregions—either in accordance with the actual boundaries of articles thatwill later be printed on the pages, or in a grid pattern, e.g., of 3columns across by 5 rows high. Each region conveys a distinct Bedoopcode, permitting different portions of the page to lead to different webdata.)

After watermarking and printing, the pages thus produced are bound inthe usual fashion with others to form the finished magazine. (Not allpages in the magazine need to be watermarked.)

Of course, the watermarking can be effected by processes other thanink-jet printing. For example, texturing by pressure rollers is anotheroption well suited for the large volumes of paper to be processed.

On presenting a magazine to the optical scanner device of aBedoop-compliant computer, the computer senses the Bedoop data, decodessame, and launches a web browser to an internet address corresponding tothe Bedoop data. If the magazine page is an advertisement, the internetaddress can provide information complementary to the advertisement. Forexample, if the magazine page is an advertisement for a grocery item,the Bedoop data can identify a web page on which recipes using theadvertised item are presented. If the magazine page includes a photo ofa tropical beach, the Bedoop data can lead to a travel web page (e.g.,hosted by Expedia or other travel service) that presents fare andlodging information useful to a reader who wants to vacation at theillustrated beach. (The fare information can be customized to thereader's home airport by reference to user profile data stored on theuser's computer and relayed to the web site to permit customization ofthe displayed page.)

The data to which the Bedoop data leads needn't be static; it can beupdated on a weekly, daily, or other basis. Thus, if a months-oldmagazine page is presented to a Bedoop device, the resultant data can beup-to-the-minute.

In the case of advertising, the inclusion of Bedoop data increases thevalue of the ad to the advertiser, and so merits a higher charge to theadvertiser from the magazine publisher. This higher charge may be sharedwith the enterprise(s) that provides the Bedoop technology andinfrastructure through which the higher value is achieved.

Business Card Applications

Conventional business cards can be steganographically encoded withBedoop data, e.g., by texturing, watermark tinting, ink-jet splattering,text steganography, etc. As with many of the earlier-describedembodiments, the steganographic encoding is tailored to facilitatedecoding in the presence of arbitrary rotation or scale distortion ofthe card introduced during scanning. (Some such techniques are shown,e.g., in applicant's related patents identified above. Various othertechniques are known to artisans.)

When a recipient of a business card holds it in front of a Bedoopsensor, the operating system on the local system launches a local Bedoopapplication. That local Bedoop application, in turn, establishes anexternal internet connection to a remote business card server. Theaddress of that server may already be known to the local Bedoopapplication (e.g., having been stored from previous use), or the localBedoop system can traverse the above-described public network of DNSservers to reach the business card server.

A database on the business card name server maintains a large collectionof business card data, one database record per UID. When that serverreceives Bedoop data from a local Bedoop system, it parses out the UIDand accesses the corresponding database record. This record typicallyincludes more information than is commonly printed on conventionalbusiness cards. Sample fields from the record may include, for example,name, title, office phone, office fax, home phone, home fax, cellularphone, email address, company name, corporate web page address, personalweb page address, secretary's name, spouse's name, and birthday. Thisrecord is transmitted back to the originating Bedoop system.

The local Bedoop system now has the data, but needs further instructionfrom the user as to how it should be processed. Should a telephonenumber be dialed? Should the information be entered into a personalcontact manager database (e.g., Outlook) on the local system? Etc.

In an exemplary embodiment, the local system presents the availablechoices to the user, e.g., by textual prompts, synthesized voice, etc.The user responds by manipulating the business card in a manner promptedby the system (e.g., move down to telephone at office; move up totelephone at home; move right to access corporate web page; move left toaccess personal web page; rotate left to enter certain elements from thedatabase record (filtered in accordance with a template) into personalcontact manager database, etc. The local Bedoop system respondsaccordingly.

Some card givers may choose to make additional information available tocard recipients—information beyond that known in prior artcontact-management software applications. For example, one of thechoices presented by a local Bedoop system in response to presentationof a business card may be to review the card-giver's personal calendar.(The card-giver can maintain his or her personal calendar on aweb-accessible computer.) By such arrangement, the card-recipient canlearn when the card-giver may be found in the office, when appointmentsmight be scheduled, etc., etc.

Typically, access to this web-calendar is not available to casual webbrowsers, but is accessible only in response to Bedoop data (which maythus be regarded as a form of authentication or password data).

Some users may carry several differently-encoded cards, each with adifferent level of access authorization (e.g., with different UIDs).Thus, some cards may access a biographical page without any calendarinformation, other cards may access the same or different page withaccess enabled to today's calendar, or this week's calendar, only, andstill other cards (e.g., the “spouse” card) may access the same ordifferent page with access enabled for the card-giver's completecalendar. The user can distribute these different cards to differentpersons in accordance with the amount of personal information desired tobe shared with each.

In accordance with a related embodiment, the database recordcorresponding to Bedoop business card data can include a “now” telephonenumber field. This field can be continually-updated throughout the daywith the then-most-suitable communications channel to the card-giver.When the card-giver leaves home to go to the office, or leaves theoffice for a trip in the car, or works a week at a corporate office inanother town, etc., this data field can be updated accordingly. (Apocket GPS receiver, with a wireless uplink, can be carried by theperson to aid in switching the “now” number among various knownpossibilities depending on the person's instantaneous position.) Whenthis database record is polled for the “now” number, it provides thethen-current information.

Consider a Bedoop-enabled public telephone. To dial the phone, abusiness card is held in front of the Bedoop sensor (or slid through anoptical scanner track). The phone interrogates the database at thebusiness card server for the “now” number and dials that number.

To update the any of the fields stored in the database record, the cardgiver can use a special card that provides write-authorizationprivileges. This special card can be a specially encoded version of thebusiness card, or can be another object unique to the card-giver (e.g.,the card-giver's driver's license).

The reference to business cards and personal calendars is illustrativeonly. Going back a century, “calling cards” were used by persons whoseinterests were strictly social, rather than business. The just-discussedprinciples can be similarly applied. Teenagers can carry small cards toexchange with new acquaintances to grant access to private dossiers ofpersonal information, favorite music, artwork, video clips, etc. Thecards can be decorated with art or other indicia that can serve purposeswholly unrelated to the Bedoop data steganographically encoded therein.

Gestural Input

A Bedoop system can determine the scale state, rotation state, X-Yoffset, and differential scale state, of an object by reference toembedded calibration data, or other techniques. If the scan deviceoperates at a suitably high frame rate (e.g., five or ten frames persecond), change(s) in any or all of these four variables can be trackedover time, and can serve as additional input.

In an earlier-discussed example, moving an object to the left or rightin front of the Bedoop scanner caused a left- or right-positioned buttonin a dialog box to be selected. This is a change in the X-Y offset ofthe scanned object. In that earlier example, moving the object inwardlytowards the camera caused the selected button to be activated. This is achange in the scale state of the scanned object.

In similar fashion, twisting the object to the left or right can promptone of two further responses in a suitably programmed Bedoopapplication. (This is a change in the rotation state.) Likewise, tiltingthe object so that one part is moved towards or away from the camera canprompt one of two further responses in the application. (This is achange in the differential scale state.)

In the business card case just-discussed, for example, the card can beheld in front of the Bedoop scanner of a computer. If the card istwisted to the left, the computer opens a web browser to a web pageaddress corresponding to Bedoop data on the card. If the card is twistedto the right, the computer opens an e-mail template, pre-addressed to ane-mail address indicated by the card.

In other examples, twisting an object to move the right edge towards thescanner can be used to effect a right mouse click input, and twistingthe object to move the right edge away from the scanner can be used toeffect a left mouse click input.

Simultaneous changes in two of these four positioning variables can beused to provide one of four different inputs to the computer (e.g., (a)twisting left while moving in; (b) twisting left while moving out; (c)twisting right while moving in; and (d) twisting right while movingout). Simultaneous changes to three or all four-of these variables cansimilarly be used to provide one of eight or sixteen different inputs tothe computer.

Simultaneous manipulations of the object in two or more of these modesis generally unwieldy, and loses the simple, intuitive, feel thatcharacterizes manipulation of the object in one mode. However, a similareffect can be achieved by sequential, rather than simultaneous,manipulation of the card in different modes (e.g., twist left, then movein). Moreover, sequential manipulations permit the same mode to be usedtwice in succession (e.g., move in, then move out). By such sequentialmanipulations of the object, arbitrarily complex input can be conveyedto the Bedoop system.

(It will be recognized that a digitally-encoded object is not necessaryto the gestural-input applications described above. Any object(talisman) that can be distinguished in the image data can bemanipulated by a user in the manners described above, and an appropriatesystem can recognize the movement of the object and respond accordingly.The provision of digital data on the object provides a further dimensionof functionality (e.g., permitting the same gesture to mean differentthings, depending on the digital encoding of the object beingmanipulated), but this is not essential.

Moreover, even within the realm of digitally-encoded gestural talismans,steganographic encoding is not essential. Any other known form ofoptically-recognizable digital encoding (e.g., 1D and 2D bar codes,etc.) can readily be employed.

In an illustrative embodiment, a business card or photograph is used asthe talisman, but the range of possible talismans is essentiallyunlimited.

Gestural Decoding Module

There are various ways in which the Bedoop system's decoding of gesturalinput can be effected. In some Bedoop systems, this functionality isprovided as part of the Bedoop applications. Generally, however, theapplications must be provided with the raw frame data in order todiscern the gestural movements. Since this functionality is typicallyutilized by many Bedoop applications, it is generally preferable toprovide a single set of gestural interpretation software functions(commonly at the operating system level) to analyze the frame data, andmake available gestural output data in standardized form to all Bedoopapplications.

In one such system, a gestural decoding module tracks the encoded objectwithin the series of image data frames, and outputs various parameterscharacterizing the object's position and manipulation over time. Two ofthese parameters indicate the X-Y position of the object within currentframe of image data. The module can identify a reference point (orseveral) on the object, and output two corresponding position data (Xand Y). The first represents the horizontal offset of the referencepoint from the center of the image frame, represented as a percentage offrame width. A two's complement representation, or other representationcapable of expressing both positive and negative values, can be used sothat this parameter has a positive value if the reference point is rightof center-frame, and has a negative value if the reference point is leftof center frame. The second parameter, Y, similarly characterizes theposition of the reference point above or below center-frame (withabove-being represented by a positive value). Each of these twoparameters can be expressed as a seven-bit byte. A new pair of X, Yparameters is output from the gestural decoding module each time a newframe of image data is processed.

In many applications, the absolute X-Y position of the object is notimportant. Rather, it is the movement of the object in X and Y fromframe-to-frame that controls some aspect of the system's response. TheBedoop application can monitor the change in the two above-describedparameters, frame to frame, to discern such movement. More commonly,however, the gestural decoding module performs this function and outputstwo further parameters, X′ and Y′. The former indicates the movement ofthe reference point in right/left directions since the last image frame,as a percentage of the full-frame width. Again, this parameter isrepresented in two's complement form, with positive values representingmovement in the rightward direction, and negative values representingmovement in the leftward direction. The later parameter similarlyindicates the movement of the reference point in up/down directionssince the last frame.

The scale, differential scale, and rotation states of the object can besimilarly analyzed and represented by parameters output from thegestural decoding module.

Scale state can be discerned by reference to two (or more) referencepoints on the object (e.g., diagonal corners of a card). The distancebetween the two points (or the area circumscribed by three or morepoints) is discerned, and expressed as a percentage of the diagonal sizeof the image frame (or its area). A single output parameter, A, whichmay be a seven-bit binary representation, is output.

As with X-Y data, the gestural decoding module can likewise monitorchanges in the scale state parameter since the last frame, and product acorresponding output parameter A′. This parameter can be expressed intwo's complement form, with positive values indicating movement of theobject towards the sensor since the last frame, and negative valuesindicating movement away.

A differential scale parameter, B, can be discerned by reference to fourreference points on the object (e.g., center points on the four edges ofa card). The two points on the side edges of the card define ahorizontal line; the two points on the top and bottom edges of the carddefine a vertical line. The ratio of the two line lengths is a measureof differential scale. This ratio can be expressed as the shorter line'slength as a percentage of the longer line's length (i.e., the ratio isalways between zero and one). Again, a two's complement seven-bitrepresentation can be used, with positive values indicating that thevertical line is shorter, and negative values indicating that thehorizontal line is shorter. (As before, a dynamic parameter B′ can alsobe discerned to express the change in the differential scale parameter Bsince the last frame, again in two's complement, seven bit form.)

A rotation state parameter C can be discerned by the angular orientationof a line defined by two reference points on the object (e.g., centerpoints on the two side edges of a card). This parameter can be encodedas a seven-bit binary value representing the percentage of rotationaloffset in a clockwise direction from a reference orientation (e.g.,horizontal). (The two reference points must be distinguishable from eachother regardless of angular position of the object, if data in the fullrange of 0-360 degrees is to be represented. If these two points are notdistinguishable, it may only be possible to represent data in the rangeof 0-180 degrees.) As before, a dynamic parameter C′ can also bediscerned to express the change in the rotation state parameter C sincethe last frame. This parameter can be in seven bit, two's complementform, with positive values indicating change in a clockwise rotation

The foregoing analysis techniques, and representation metrics, are ofcourse illustrative only. The artisan will recognize many otherarrangements that can meet the needs of the particular Bedoopapplications being served.

In the illustrative system, the Bedoop application programs communicatewith the gestural decoding module through a standardized set ofinterface protocols, such as APIs. One API can query the gestural inputmodule for some or all of the current position parameters (e.g., any orall of X, Y, A, B, and C). The module responds to the callingapplication with the requested parameter(s). Another API can query thegestural input module for some or all of the current movement data(e.g., any or all of X′, Y′, A′, B′ and C′). Still another API canrequest the gestural decoding module to provide updated values for someor all of the position or movement data on a running basis, as soon asthey are discerned from each frame. A complementary API discontinues theforegoing operation. By such arrangement, all of the gestural data isavailable, but the Bedoop application programs only obtain theparticular data they need, and only when they ask for it.

In Bedoop applications that communicate with external servers, just theBedoop data (i.e., CLASS, DNS, and optionally UID) may initially besent. If the remote server needs to consider gestural data in decidinghow to respond, the remote server can poll the local Bedoop system forthe necessary data. The requested gestural data is then sent by thelocal Bedoop system to the remote server in one or more separatetransmissions.

In other embodiments, since the gestural data is of such low bandwidth(e.g., roughly 56 bits per image frame), it may routinely andautomatically be sent to the remote computer, so that the gesture datais immediately available in case it is needed. In an illustrativeimplementation, this data is assembled into an 8-byte packet, with thefirst byte of the packet (e.g., the X parameter) being prefixed with a“1” sync bit, and subsequent bytes of the packet being prefixed with “0”sync bits. (The sync bits can be used to aid in accurate packetdecoding.)

In some embodiments, it is useful to provide for an extension to thenormal 64-bit Bedoop length to accommodate an associated packet ofgestural data. This can be effected by use of a reserved bit, e.g., inthe UID field of the Bedoop packet. This bit normally has a “0” value.If it has a “1” value, that indicates that the Bedoop data isn't justthe usual 64 bits, but instead is 128 bits, with the latter 64 bitscomprising a packet of gestural data.

Similar extension protocols can be used to associate other ancillarydata with Bedoop data. A different reserved bit in the UID field, forexample, may signal that a further data field of 256 bits follows theBedoop data—a data field that will be interpreted by the remote computerthat ultimately services the Bedoop data in a known manner. (Such bitsmay convey, e.g., profile data, credit card data, etc.) The appendeddata field, in turn, may include one or more bits signaling the presenceof still further appended data.

Grandmothers

It is a common complaint that computers are too complex for most people.Attempts to simplify computer-user interaction to facilitate use by lessexperienced users usually serve to frustrate more experienced users.

In accordance with another embodiment of the present invention, thesophistication of a computer user is steganographically indicated on atalisman used by that user to interact with the system. The computerdetects this steganographically-encoded data, and alters its mode ofinteracting with the user accordingly.

Consider internet browser software. Experienced users are familiar withthe different functionality that can be accessed, e.g., by variousdrop-down menus/sub-menus, by the keyboard shortcuts, by the menusavailable via right-clicking on the mouse, by manipulating the rollermouse scroll wheel and scroll button, etc., etc. Grandmothers of suchusers, typically, are not so familiar.

Although gestural interfaces hold great promise for simplifyinguser-computer interaction, the same dichotomy between experienced usersand inexperienced users is likely to persist, frustrating one class ofuser or the other.

To help close this gap, a computer system according to this embodimentof the invention responds to gestures in different manners, depending onthe expertise level indicated by encoding of the talisman. For an expertuser, for example, the gestural interface active in the internet browsersoftware may display the stored list of Favorite web addresses inresponse to tipping the left edge of the talisman towards the opticalsensor. Once this list is displayed, the expert user may rotate thetalisman to the right to cause the highlighting to scroll down the listfrom the top. Rotating the talisman to the left may scroll the list ofFavorites up from the bottom. The speed of scrolling can be varied inaccordance with the degree of rotation of the talisman from a defaultorientation.

In contrast, for the novice user, these talisman manipulations may beconfounding rather than empowering. Tipping the left edge of thetalisman towards the sensor may occur as often by mistake as on purpose.For such users, a more satisfactory interface may be provided by relyingon simple X-Y movement of the talisman to move an on-screen cursor, witha movement of the talisman towards the sensor to serve as a selectionsignal (i.e., like a left-mouse click).

(In the example just-cited, the expert user summoned a list of Favoriteweb sites. Different “Favorites” lists can be maintained by thecomputer—each in association with different talismans. A husband whouses one talisman is provided a different “Favorites” list than a wifewho uses a different talisman.)

Printed Pictures

In accordance with this aspect of the invention, a printed photographcan be steganographically encoded with Bedoop data leading toinformation relating to the depicted person (e.g., contact information,biographical information, etc.).

Such a photograph can be presented to a Bedoop sensor on a telephone. Ina simple embodiment, the telephone simply processes the Bedoop data toobtain a corresponding default telephone number, and dials the number.In other embodiments, various options are possible, e.g., dial homenumber or dial work number. On presenting the photograph to thetelephone, for example, moving the photo to the left may dial the personat home, while moving he photo to the right may dial the person at work.

As telephones evolve into more capable, multi-function devices, othermanipulations can invoke other actions. In a computer/telephone hybriddevice, for example, rotating the photo counterclockwise may launch aweb browser to an address at which video data from a web cam at thepictured person's home is presented. Rotating the photo clockwise maypresent an e-mail form, pre-addressed to the e-mail address of thedepicted person. Moving the photo to the right may query a database onthe system for other photographs depicting the same individual orsubject, which can be presented in response to further user input. Etc.

In this and other embodiments, it is helpful for the Bedoop device toprompt the user to aid in manipulating the object. This can be doneaudibly (e.g., “move photo left to dial at home”) or by visual clues(e.g., presenting left- or right-pointing arrows).

Bedoop data in photographs can also be used to annotate the photographs,as with notes on the back of a photograph, or printed under thephotograph in a photo album. The Bedoop data can lead to a remotedatabase, where the photograph owner is permitted to enter a textual (oraudio) narrative in association with each photograph's UID. Years later,when some of the names have been forgotten, the photograph can bepositioned in front of a Bedoop sensor, and the system responds byproviding the annotation provided by the photograph owner years earlier.

Drivers Licenses and Other Cards

Drivers licenses, social security cards, or other identity documents maybe encoded by the issuing authority with Bedoop data that permits accessto the holder's personal records over the web. On presenting thedocument to a Bedoop system, the system directs a web browser to aprivate address corresponding to data encoded on the document. At thataddress, the holder of the document can review governmental records,such as state or federal tax return data, social security entitlements,etc., as well as privately-maintained records, such as credit records,etc. User selection among various functions can be effected by spatialmanipulation of the document. (Entry of additional data, such as socialsecurity number or mother's maiden name, may be required of the user toassure privacy in case the document is lost or stolen.)

By manipulating a driver's license in front of a Bedoop sensor, a usercan request renewal of the driver's license, and authorize payment ofthe corresponding fee.

Bank cards (debit, credit, etc.) can similarly be encoded with Bedoopdata to permit the holder to access bank records corresponding to thebank card account. (Entry of a PIN code may be required to assureprivacy.)

Such documents can also be used to access other personal data. Oneexample is e-mail. A traveler might pause at a Bedoop kiosk at anairport and present a driver's license. Without anything more, the kioskmay present email that is waiting for the traveler on an associateddisplay screen.

On recognizing a driver's license, the kiosk can access a remote site(which may be maintained by the Department of Motor vehicles, anothergovernment entity, a private entity, or by the traveler), authenticatingthe operation by presenting Bedoop data encoded on the license, andobtaining information that the person has pre-approved for release inresponse to such authorized access. This information can include e-mailaccount and password information. Using this information, the kioskqueries the corresponding e-mail server, and downloads a copy ofrecently received mail for presentation at the kiosk. (A user-enteredPIN number may be required at some point in the process, e.g., inquerying the remote site for sensitive e-mail password data, beforepresenting the downloaded e-mail for viewing, etc., to ensure privacy.)

Other cards carried in wallets and purses can also be encoded to enablevarious functions. The local sandwich shop that rewards regularcustomers by awarding a free sandwich after a dozen have been purchasedcan encode their frequent-buyer card with Bedoop data leading to theshop's web-based sandwich delivery service. Or the frequent-buyer cardcan be eliminated, and customers can instead wave their business card orother identity document in front of the shop's Bedoop sensor to getpurchase credit in a tally maintained by the sandwich shop's computer.

Food stamps, health insurance cards, and written medical prescriptions,can likewise be encoded with digital data to enable the provision of newfunctionality.

At large trade shows, such as COMDEX, vendors needn't publish thick,glossy brochures to hand out to visitors. Instead, they may printvarious stylish promo cards for distribution. When later presented to aBedoop sensor, each card leads to a web-based presentation—optionallyincluding persuasive video and other multi-media components. The usercan be prompted to provide data to customize, or focus, the presentationto the user's particular requirements. If the user wants furtherinformation, a request can be made by the click of a mouse (or the twistof a card).

Prizes and Product Promotions

Product packaging (e.g., Coke cans, Snapple bottles, Pepsi 12-packboxes) can be encoded for contest purposes. The encoding can becustomized, item to item, so that selected items—when Bedoop scanned—arerecognized to be the one in a hundred that entitles the owner to a cashor merchandise prize. A remote server to which the item's Bedoop data isprovided queries the user for contact information (e.g., address, phonenumber) so the prize can be awarded or, for smaller prizes, the systemcan print out an award certificate redeemable at local merchants forproducts or cash. Once a winning item is identified to the remoteserver, its UID on the server is marked as redeemed so that the itemcannot later be presented to win another prize.

In other such embodiments, all of the items are encoded identically.Winners are determined randomly. For example, during a contest period,persons around the world may present Coke cans to Bedoop systems. Thecorresponding Bedoop application on each user computer submits Bedoopdata to a corresponding web address. The user's e-mail address may alsobe included with the submission. As this data is relayed to thecorresponding server computer(s), every N^(th) set of data is deemed tobe a winner, and a corresponding award notification or prize isdispatched to the Bedoop system from which the winning set of dataoriginated.

The server computer that receives such contest submittals from clientBedoop systems can be arranged to prevent a single user from bombardingthe server with multiple sets of data in an attempt to win by bruteforce. (This may be done, for example, by checking the included e-mailaddress, and not considering a data submittal if the same e-mail addresswas encountered in data submitted within the past hour. Similaranti-brute-force protection can be provided on the user's computer,preventing, e.g., repeated contest data to be sent more frequently thanonce per hour. More sophisticated anti-brute-force measures can ofcourse be provided.)

Product Information and Ordering

In accordance with another embodiment of the present invention, productpackaging and product advertisements can be encoded with Bedoop datathat, when presented to a Bedoop system, initiates a link to a web pagefrom which that product can be purchased, or more information obtained.Once the link has been established, the user can be instructed tomanipulate the object in different of the earlier-described modes toeffect different functions, e.g., move towards camera to order theproduct; move away from camera for product information. If the object ismoved towards the camera to effect an order, the user can be prompted tofurther manipulate the object to specify delivery options (e.g., rotateleft for overnight mail, rotate right for regular mail). If the objectis moved away from the camera to request product information, the usercan be promoted to further manipulate the object to specify the type ofinformation desired (e.g., rotate left for recipes, rotate right for FDAnutritional information, move up for information on other products inthis family, move down to send an email to the product manufacturer).

Credit card or other customer billing information, together with mailingaddress information, can be stored in a profile on the Bedoop system,and relayed to the transactional web site either automatically when apurchase action is invoked, or after the user affirms that suchinformation should be sent (which affirmation may be signaled bymanipulation of the packaging or advertisement in one of theearlier-described modes). Other modes of payment can naturally beemployed. (One such alternative is the first-to-redeem electronic moneysystem described in the present assignee's patent application60/134,782.)

Clothing

In accordance with another aspect of the invention, clothing can beordered on-line by presenting to a Bedoop system a photograph from acatalog, or a garment tag or label. Encoded on each isproduct-identifying data, including a manufacturer ID. The Bedoop systemresponds by establishing a link to a remote computer maintained by or onbehalf of the manufacturer. In addition to relaying the productidentification data to the remote computer, the Bedoop application alsosends some or all of a clothing profile maintained by the user on thelocal computer. This profile can specify, e.g., the person's weight,height, shoe size, waist size, inseam, etc. The remote computer canconfirm availability of the identified item in the size specified in theclothing profile, and solicit payment and shipping instructions.

Computer Access Cards

This disclosure earlier considered access cards used to gain access tosecure buildings. Related principles can be used in conjunction withcomputer access.

A driver's license, employee photo ID, or other such document can bepresented to a Bedoop sensor on a computer. The computer recognizes theuser and can take various steps in response.

One response is to log onto a network. Another is to set load a userprofile file by which the computer knows how to arrange the desktop inthe user's preferred manner. By manipulating the Bedoop-encoded object,the user can further vary the environment (e.g., rotate left to launchstandard business productivity applications and software developmentapplications; rotate left to launch lunchtime diversions—stock update,recreational games, etc.)

Hotel rooms are increasingly providing computer services. By presentinga driver's license, a Bedoop-equipped computer in a hotel room can linkto a remote site indicated by the Bedoop data, obtain preference datafor that user, and launch applications on the hotel computer in anarrangement that mimics that user's familiar work computer environment.

Audio/Video Disks, Software, and Books

Bedoop data can be conveyed by indicia or texturing on the surfaces ofCD and DVD disks, on the labels (or authenticity certificates) for same,on the enclosures for same (e.g., jewel box, plastic case, etc.), onbook dust jackets, on book pages, etc. Any of these objects can bepresented to a Bedoop device to establish a link to a related web site.The consumer can then manipulate the object (or otherwise choose) toselect different options.

For music, one option is to receive MP3 or other clips of songs by thesame artist on other CDs, or of songs from other artists of the samegenre. Another is to view music video clips featuring the same artist.Still another is to order tickets to upcoming concerts by that artist.In-store kiosks can permit tentative customers to listen to sampletracks before they buy.

Similar options can be presented for video DVDs. In the case of video,this can include listings of other movies with the same director, withthe same star(s), etc. In the case of software, the options can includeadvisories, bug fixes, product updates and upgrades, etc. Naturally, theuser can make purchases from these sites, e.g., of other music by thesame artist, other videos with the same star, software upgrades, etc.

Similar options can be accessed using Bedoop data associated withprinted book materials.

Ad Tracking

Advertisers commonly use different advertisements for the same productor service, and employ means to track which ad is more effective withinwhich demographic group. Bedoop can provide such functionality.

Consider a travel service web site that is promoting Hawaiian vacations.Bedoop data from several advertisements can lead consumers to the site.

Identical advertisements can be placed in several different magazines.Each is encoded with a different Bedoop UID. By monitoring the UIDs ofthe Bedoop inquiries to the site, the travel service can determine whichmagazines yield the highest consumer response (e.g., per thousandreaders).

Likewise, within a single magazine, two or more advertisements may beencoded with Bedoop data leading to the site—again, each with adifferent UID. Again, analysis of the UIDs used in accessing the sitecan indicate which advertisement was the more effective.

The instantaneous nature of the internet links permits advertisers tolearn how consumer responses to print advertisements vary withtime-of-day, yielding information that may assist in making ads forcertain products more effective.

More elaborate variants and combinations of the foregoing are, ofcourse, possible, If the consumers provide personal information inresponse to the ads (either by permitting access to pre-stored personalprofile data, or by filling in web-based forms, or by manipulation ofthe ad (e.g., “please move the ad towards your Bedoop sensor if youdrank coffee this morning”)), still richer statistical data can begleaned.

Rolodex of Cards

Bedoop-encoded business cards as detailed above can be accumulated andkept near a telephone or computer in a Rolodex-like arrangement. If arefrigerator ice-maker malfunctions, a homeowner can find the card forthe appliance repairman used a few years ago, and present it to a Bedoopsensor. A link is established to the repairman's company (e.g., web siteor via telephone). At a web site, the repairman may provide basicinformation, such as hours of availability, current fee schedule, etc.The homeowner may select an option (by card gesture or otherwise) toinvoke a teleconference (e.g., NetMeeting) to consult about the problem.Or the homeowner may select another option to send e-mail. Still afurther option may permit the homeowner to schedule a house call on therepairman's weekly calendar. Still a further option may permit thehomeowner to view one or more short videos instructing customers how tofix certain common appliance problems.

Stored Value Cards

The earlier cited “first-to-redeem” electronic money system may encodeBedoop data on a card that leads to storage at which the random-numbertokens (which represent increments of money) are stored. Presenting thecard to a Bedoop system launches an application that reads and encryptsthe tokens and forwards the encrypted data to the clearinghouse computerof the corresponding bank to learn their remaining value. There thetokens are decrypted and checked for validity (but not redeemed). Thebank computer responds to the Bedoop system, indicating the remainingvalue of the tokens on the card.

For security reasons, the storage containing the random-number tokensshould not be generally accessible. Instead, the user must provideauthentication data indicating authorization to gain access to thatinformation. This authentication data may be a PIN code. Or the user mayprovide authentication by presenting a second Bedoop-encoded object,e.g., a driver's license to the Bedoop system. (Many other Bedoopsystems may advantageously use, or require the use of, two or moreBedoop objects—either presented one after the other, or all at the sametime. The Bedoop system can provide visual or audible prompts leadingthe user to present the further Bedoop object(s) as necessary.

Ski Lift Tickets

In accordance with another embodiment, ski lift tickets are Bedoopencoded to provide various functionality.

For example, instead of buying a lift ticket good for a day, a skier maypurchase a ticket good for eight lifts. This data is encoded on theticket, and sensed by a Bedoop sensor at each lift. The sensors arenetworked to a common server that tracks the number of lifts actuallypurchased, and updates the number as used. The skier is informed of thenumber of rides remaining on entering or leaving the lift. Statisticaldata can be collected about trail usage (e.g., N % percent of skiers skiall day along just two lifts, etc.).

Off the slopes, back at home, the used lift ticket may be presented to aBedoop sensor to obtain current snow conditions and lift hours, or toreview trail maps, or to order ski vacation packages. If the ticket isencoded with the owner's name, UID, or other information ofcommercial/marketing interest, local merchants may give the bearerdiscounts on selected goods in response to Bedoop scanning of the ticketand recovery of such information.

REI Membership Cards

Membership cards for certain stores can be Bedoop-encoded to provideadded value to the member. For outdoor gear stores such as REI,presentation of the card to a Bedoop sensor can lead to a library ofUSGS maps, to web pages with current fishing and hunting regulations,etc. Naturally, the store's on-line ordering site is just a quick twistaway.

Theme Park Tickets

Theme park tickets can be encoded with the age and gender of thevisitor, and with additional data permitting the experience to becustomized (e.g., from a roster of theme park personalities, thevisitor's favorite is Indiana Jones). Throughout the park are kiosks towhich the visitor can present the ticket to orchestrate the visit tofollow a particular story line. Some kiosks issue premiums matching theage/gender of the recipient.

Car Keys

In accordance with another embodiment of the invention, car keys (or keyring fobs) are Bedoop encoded. When the car is taken to a shop forservice, the mechanic presents the key to a Bedoop sensor, and therebyobtains the car's maintenance history from a remote server on which itis maintained. At home, the key can be presented to a Bedoop sensor andmanipulated to navigate through a variety of automotive-related websites.

In some embodiments, the Bedoop-encoded object is not used to navigateto a site, but is instead used to provide data once a user's computer isotherwise linked to a web site. A user surfing the web who ends up at acar valuation site can present a key to the Bedoop scanner. The Bedoopdata is used to access a remote database where the make, model, options,etc., of the car are stored. This data is provided to a database enginethat returns to the user the estimated value of the car.

While visiting a mechanic's web site, presentation (and optionallymanipulation) of a key or key ring fob can be employed to schedule aservice appointment for the car.

Fashion Coordination

Some department stores and clothing retailers offer “personal shoppers”to perform various services. For example, a customer who is purchasing adress may ask a personal shopper for assistance in selecting shoes oraccessories that complement the dress.

A Bedoop-encoded garment tag on the dress can be employed to obtainsimilar assistance. In response to such a tag, a Bedoop system can querya database to obtain a mini-catalog of clothes and accessories that havepreviously been identified as complementing the dress identified by thetag. These items can be individually displayed on a screen associatedwith the system, or a virtual model wearing the dress—together with oneor more of the recommended accessories—can be synthesized and depicted.The shopper may quickly review the look achieved by the model wearingthe dress with various different pairs of shoes, etc., by repeatedlyactivating a user interface control (by mouse, touch screen, or garmenttag gestures) to cycle through different combinations.

A shopper's credit card can be Bedoop-encoded so as to lead Bedoopsystems of particular stores (i.e., stores pre-authorized by theshopper) to a profile on the shopper (e.g., containing size information,repeat purchase information, return history, style/color preferences,etc.).

Credit Card Purchases

When a consumer visits a commercial web site and wishes to purchase adisplayed product, the transaction can be speeded simply by presenting aBedoop-encoded credit card to a Bedoop sensor on the user's computer.The Bedoop data on the card leads to a database entry containing thecredit card number and expiration date. The Bedoop application thensends this information (optionally after encrypting same) to the website with instructions to purchase the depicted product.

(Impulse purchases are commonly deterred by the hurdles posed betweenthe purchase impulse and the completed purchase. This and other Bedoopapplications aid in reducing such hurdles.)

Product Marketing

Bedoop data relating to one product or service can be used tocross-market others products and services. Consider a consumer whopurchases a pair of golf shoes. The box is Bedoop encoded. By presentingthe box to a Bedoop system, the consumer is linked to a web page thatpresents various promotional offers. The consumer may, for example,elect to play a free round of golf at one or more identified local golfcourses, or print a coupon for ten percent off any order of socks froman on-line sock merchant. (Various means can be employed to preventmultiple redemptions from a single box. One is a serial number that istracked by the web page or cross-marketed merchant, and only honoredonce. Another is identification data corresponding to the consumer thatis tracked to prevent multiple redemptions.)

Product tags can likewise be Bedoop-encoded. A tag from an article ofNike apparel can lead to the Nike on-line store, where the user can buymore merchandise. If the tag is from a soccer jersey, a certain tagmanipulation (e.g., rotate left) may lead the user to a special-interestsoccer page, such as for the World Cup. A tag on a golf glove may leadto a website of a local golf course. Twist left to reserve a tee time;twist right to review course maps and statistics. Bedoop kiosks can beprovided in retail stores to let consumers use the Bedoop features.

Travel Planning Services

After making a reservation at a resort, a consumer is typically mailed(by email or conventional mail) various confirmation information. If notalready printed, the consumer can print this information (e.g., aconfirmation card).

Bedoop-encoding on the printed object can lead to web-based informationrelating to the reservation (e.g., reservation number, the consumer'sname, arrival/departure dates, etc.). If the consumer wishes to makedinner or golf reservations, this object is presented to a Bedoopsystem—either at the user's home, at an airport kiosk, etc. The systemrecognizes the object type and encoded data, and establishes a link to aremote computer that provides various information and schedulingservices for the resort. By manipulating the object (or otherwise) theconsumer selects desired dinner and golf tee times. The system alreadyhas the reservation number (indexed by the UID), so tedious provision ofsuch data is avoided.

In some embodiments, the remote computer is not maintained by theresort, but is rather maintained by an independent travel service. (Thetravel service may also maintain the DNS leaf node server.) The computercan present a web page (branded by the travel service or not) thatoffers the scheduling options desired by the user, and also presentslinks to other information and services (e.g., offering entry tickets tonearby attractions, and advertising nearby restaurants).

Airline tickets (or e-ticket confirmations) can be similarly encodedwith Bedoop data. These items may be presented to Bedoop systems—at atraveler's home or in airports—to permit review and changing of travelitinerary, reserve hotels and rental cars, secure first-class upgrades,check the airplane's seating arrangement, review frequent flier status,scan tourist information for the destination, etc.

Movie Tickets

As indicated earlier, movie tickets can be encoded with Bedoop dataidentifying, e.g., the movie title and date. When a movie viewer returnshome, the ticket stub can be presented to a Bedoop system. One of theoptions presented by the corresponding Bedoop application can be tolaunch a pay-per-view screening of the just-seen movie at a discountedrate. Another is to download the movie onto a writeable DVD disk at theviewer's home, perhaps serialized to permit playback only on thatviewer's DVD player, or enabled for only a few playbacks, etc. (again,likely for a discounted fee). Still another option is to presentweb-delivered video clips from the movie. Another is to offer relatedmerchandise for purchase, possibly at discount to retail. (Thesefeatures may be available for only a limited period after the dateencoded on the ticket stub.) Another is to alert the consumer toupcoming movies of the same genres, or with the same director or stars,or released by the same studio. Still another is to direct a web browserto an on-line ticket merchant for tickets to other movies. The consumermay navigate among these options by manipulating the ticket stub, orotherwise.

The same, or related, options can likewise be provided in response toBedoop data detected from a book jacket presented to a Bedoop system.

Video Recording

A video recording device can be programmed to record a broadcast programby presenting a Bedoop sensor with a printed promotion for the program(e.g., an advertisement in a newspaper or TV Guide). Bedoop-encodedwithin the printed document is data by which the Bedoop system (whichmay be built into the video recorder or separate) can set the recordingtime, date, and channel.

Set Top Boxes

Many entertainment-related applications of Bedoop data can beimplemented using television set top boxes. Such boxes includeprocessors, and typically include a return channel to a controlfacility. The provision of a Bedoop chip and optical sensor can vastlyincrease the functionality these devices presently provide.

Special Event Tickets

Consider a ticket to a basketball game. By presenting the ticket to aBedoop system, a user may access the web site of either team so as toreview recent scores and statistics. The user may also obtain aweb-based virtual tour of the arena, and review seating maps. Ticketsfor upcoming games may be ordered, as well as pay-per-view games andteam souvenirs. For high-priced tickets, the user may be entitled topremium web features, such as on-line text-, audio-, or video-chatsession with a team star on the day before the game.

Unlike conventional tickets, Bedoop-encoded tickets need not limit theuser to a predetermined seat. While the ticket may be printed with anominal seat, the user may present the ticket to a Bedoop sensor andaccess a web site at which a different seat can be reserved. Onattending the event, the consumer presents the ticket to a Bedoop sensorthat reads the ticket UID and looks up the seat assignment most-recentlypicked by the consumer. It then prints a chit entitling the consumer totake the seat earlier selected from the transactional web site.

Signet Rings

Signet rings have historically been used to indicate a person's identityor office. Such rings, or other items of personal jewelry, can beencoded with Bedoop data (either by texturing or printing) and presentedas necessary to Bedoop systems. The extracted Bedoop data can lead to asecure web site indicating the person's name and other information(i.e., a web site that has anti-hacking measures to prevent illicitchange of the stored identification information). Such a signet ring canbe presented to Bedoop systems that require a high-confidenceconfirmation of identity/authorization before proceeding with a Bedoopfunction.

Post-It® Notes

Pads of Post-It® notes, or other pads of paper, can be marked by themanufacturer (either by texturing, watermarked tinting, ink-jetspattering, etc.) to convey steganographic data (e.g., Bedoop data).When such a note is presented to a Bedoop system, the system may launchan application that stores a snapshot of the note. More particularly,the application may mask the note-portion of the image data from theother image data, virtually re-map it to a rectangular format ofstandardized pixel dimensions, JPEG-compress the resulting image, andstore it in a particular computer subdirectory with a name indicatingthe date of image acquisition, together with the color and/or size ofthe note. (These latter two data may be indicated by data included inthe Bedoop payload.) If the color of the note is indicated by digitaldata (e.g., in the file name), then the image itself may be stored ingrey-scale. When later recalled for display, the white image backgroundcan be flooded with color in accordance with the digital color data.

The Bedoop system may buffer several past frames of image data. When theobject is recognized as a Post-It note whose image is to be saved, thesystem may analyze several such frames to identify the one best-suitedfor storage (e.g., check the spatial frequency content of the note asimaged in each frame, to identify the one with the finest detail), andstore that one.

When a Post-It note is recognized by the Bedoop system, the system mayemit a confirmation tone (or other response) to indicate that the objecthas been recognized, but not immediately execute the snapshot operation.Instead, the system may await a further instruction (e.g., gesture) toindicate what operation is desired.

By moving the note towards the sensor, for example, the user can signalthat a snapshot operation is to be performed. (This closer presentationof the note may also permit the imaging system to capture a moredetailed frame of image data.) By moving the note away, the system mayrespond by reading, decompressing, and displaying the six most-recentlystored Post-It note images, in tiled fashion, on the computer screen.The individual notes can be displayed at their original dimensions, oreach can be re-sized to fill the full height or width of a tile. A userinterface control (responsive to gestures, mouse operation, keyboardscroll arrows, etc.) allows the user to scroll back in time to anydesired date.

The full 64-bit Bedoop payload of other embodiments may not be neededfor Post-It notes. In the just-given example, for example, the Bedoopsystem responds to all Post-It notes in the same fashion. Thus, anabbreviated Bedoop format that indicates simply ‘I'm a Post-It note,yellow, size 3″×3″’ can suffice. The twelve bit CLASS ID, with eightfurther bits to indicate color/size combinations, may be sufficient.Reducing the payload permits it to be more robustly encoded on smallobjects. (As noted below, Bedoop decoding systems can look for severaldifferent data formats/protocols in trying to extract Bedoop data froman object.)

Alignment of Documents for Other Purposes

While the just-described pre-marked paper triggered a Bedoop responsewhen presented to a Bedoop sensor (i.e., take a snapshot of the paper),the markings can be used for purposes other than to trigger Bedoopresponses.

Regardless of the particular data with which the paper is encoded, theembedded subliminal graticules, or other steganographically-encodedregistration data, can be used by other applications to correctmisalignment of scanned data. In a photocopier, for example, a documentneed not be placed exactly squarely on the glass platen in order toyield a properly-aligned photocopy. The scanner scans the skeweddocument and then detects the steganographic registration markings inthe resulting scan data. This data is then processed to virtuallyre-register same, so that the registration markings are in a desiredalignment. The processed scan data is then provided to the xerographicreproduction unit to yield a photocopy in which the skew effect isremoved.

The same technique is likewise applicable to video recorders, digitalcameras, etc. If such a device images an object (e.g., a photograph)with steganographic registration markings, these markings can be used asa guide in re-registering the resulting data to remove mis-alignmenteffects.

Postal Mail Information

Many contexts arise in which data to be presented to a consumer isvaluable only if timely. The postal service mail is ill-suited for somesuch information due to the latency between printing a document, and itsultimate delivery to a recipient. Bedoop principles, however, allow therecipient to take a postal object that was printed well before delivery,and use it on receipt (i.e., present to a Bedoop system) to receiveup-to-the-minute information. In this and other embodiments, the Bedoopdata can also uniquely identify the addressee/recipient/user, so the website can present data customized to that user.

Distributors of printed advertising can reward Bedoop-driven consumervisits to their web sites by issuing digital tokens or coupons that canbe redeemed for premiums, cash-back, etc. Every millionth visitor wins amillion pennies (with appropriate safeguards, e.g., preventing more thanone entry an hour).

Classes of Bedoop Encoding

The above-described embodiments focused on use of Bedoop data afterdecoding. Additional insight may be gained by examining the earlier partof the process—encoding.

Encoding can be performed in many contexts, which may be conceptualizedas falling into three broad classes. The first is static marking, inwhich a document designer, pre-press service bureau, advertising agencyor the like embeds Bedoop data. The second is dynamic marking, in whichautomated systems encode, or vary, Bedoop data “on the fly.” Suchsystems can tailor the Bedoop data to particularly suit the context,e.g., to the moment, place, user, etc. The third is consumer marking, inwhich Bedoop data is added to a document at the time of printing.

The second class of encoding enables features not available from thefirst. Consider an American Express travel web page with informationabout travel to Hawaii. A DNS leaf node server points to this page inresponse to certain Bedoop data—e.g., data encoded in a magazinephotograph of a Hawaiian beach scene.

Actually, all Bedoop data having a certain CLASS and DNS ID may lead tothis web page, irrespective of the UID data. If the magazine photo isencoded with a particular “don't care” UID field (e.g.,111111111111111111111111), this may signal the originating Bedoopsystem—or any intervening system through which the Bedoop datapasses—that arbitrary data can be inserted in the UID field of thatBedoop packet. The originating Bedoop system, for example, can insert adynamically-configured series of bits into this field. Some of thesebits can provide a profile of the user to the remote server, so that theBedoop response can be customized to the user. (The user would naturallypre-approve information for such use so as to allay privacy concerns.)

As one example, the local Bedoop system can set the least significantbit of the UID field to a “0” if the user is male, or to a “1” if theuser is female. The next four bits can indicate the user's age by one ofsixteen age ranges (e.g., 3 or less, 4-5, 6-7, 8-9, 10-11, 12-13, 14-15,16-17, 18-20, 21-24, etc.).

Alternatively, or in addition, the local Bedoop system can stuff thedon't-care UID field (all of it, or in part) with signature data tendingto uniquely identify the local Bedoop system (e.g., system serialnumber, a hash code based on unchanging data unique to that system,etc.) By reference to such data, the remote server can identify repeatvisits by the same user, and can tailor its responses accordingly (e.g.,by recalling a profile of information earlier entered by the user andstored at the remote server, avoiding the need for data re-entry).

More on Optical Input Devices

It is expected that image input devices will soon become commonplace.The provision of digital cameras as built-in components of certaincomputers (e.g., the Sony Vaio laptops) is just one manifestation ofthis trend. Another is camera-on-a-chip systems, as typified by U.S.Pat. No. 5,841,126 and detailed in Nixon et al., “256×256 CMOS ActivePixel Sensor Camera-on-a-Chip,” IEEE J. Solid-State Circuits, Vol.31(12), pp. 2046-2051 (1996), and Fossum, “CMOS Image Sensors:Electronic Camera-on-a-Chip,” IEEE Transactions of Electron Devices,vol. 44, No. 10, October 1997. Still another is head-mounted cameras (asare presently used in some computer-augmented vision systems). These andother image input devices are all suitable for use in Bedoop systems.

Camera-on-a-chip systems can be equipped with Bedoop detector hardwareintegrated on the same chip substrate. This hardware can be arranged tofind and decode Bedoop data from the image data—notwithstanding scale,rotation, differential scaling, etc. Gestural decoding can also beprovided in hardware, with the resulting data output in packet form on aserial output bus. Such a chip can thus provide several outputs—imagedata (either in raw pixel form, or in a data stream representing theimage in one of various image formats), 64 bits of Bedoop data (seriallyor in parallel), and decoded gesture data.

In other embodiments, the Bedoop detector (and/or the gestural decoder)can be on a substrate separate from the camera system.

To accommodate different Bedoop data formats and protocols, the hardwarecan include RAM or ROM in which different format/protocol information isstored. (These different formats/protocols can relate, e.g., to Bedoopsystems employing different data payload lengths, different subliminalgrids, different encoding techniques, etc.) As the Bedoop system staresout and grabs/analyzes frames, each frame can be analyzed in accordancewith several different formats/protocols to try and find aformat/protocol that yields valid Bedoop output data.

Movable Bedoop Sensors

Although the illustrated Bedoop systems are generally stationary, theyneed not be so. They can be portable. Some such systems, for example,employ palmtop computers equipped with optical sensor arrays. If thepalmtop is provided with live network connectivity (e.g., by wireless),then Bedoop applications that rely on remote computers can beimplemented just as described. If the palmtop is not equipped with livenetwork connectivity, any Bedoop applications that rely on remotecomputers can simply queue such communications, and dispatch same whenthe palmtop next has remote access (e.g., when the palmtop is nextplaced in its recharger and is coupled to a modem through which internetaccess can be established).

Another variant is a Bedoop sensor that is movable around a desk orother work-surface, like a mouse. Such a sensor can be coupled to theassociated computer by cabling, or a wireless interface can be used. Theperipheral may be arranged for placement on top of an item in order toread digital data with which the object is marked. (Built-inillumination may be needed, since the device would likely shadow theencoding.) Some forms of such peripherals are adapted to serve both asgeneral purpose digital cameras, and also as Bedoop sensors.

Such a peripheral would find many applications. In “reading” a magazineor book, for example, it may be more intuitive to place a Bedoop reader“on” the object being read, rather than holding the object in the air,in front of a Bedoop sensor. This is particularly useful, e.g., when amagazine page or the like may have several differently-encoded Bedoopsections (corresponding to different articles, advertisements, etc.),and the user wants to assure that the desired Bedoop-encoded section isread.

The “bookmark” paradigm of internet browsers might be supplemented withpaper bookmarks, e.g., Bedoop data encoded on one or more pages ofpaper. To direct a browser to a particular bookmarked destination, theperipheral is simply placed on top of the page (or part thereof) that ismarked with the corresponding Bedoop data. A user may print a “map”comprised of postage stamp-sized regions tiled together, each of whichregions represents a favorite web destination.

Such a map may be printed on a mouse pad. Indeed, mouse pads withcertain maps pre-encoded thereon may be suitable as promotionalmaterials. A company may offer to print a family photograph on such apad. Encoded within the photograph or the pad texture are addresses ofweb sites that have paid a fee to be accessible in this manner on auser's desk.

Like mice—which are provided with buttons, roller wheels, and rollerbuttons in addition to X-Y encoders—movable Bedoop encoders can likewisebe provided with auxiliary switches and roller inputs to complement thedata input provided by the optical sensor. Indeed, some embodimentsintegrate the functions of Bedoop peripheral with a mouse. (Theundersides of mice are generally under-utilized, and can readily beequipped with an image sensor.)

Gestural input can readily be provided by such a peripheral—in thiscontext moving the sensor rather than the object.

Watermarking Techniques There are nearly as many techniques for digitalwatermarking (steganographic data encoding) as there are applicationsfor it. The reader is presumed to be familiar with the great variety ofmethods. A few are reviewed below.

The present assignee's prior application Ser. No. 09/127,502, filed Jul.31, 1998 (now U.S. Pat. No. 6,345,104) shows techniques by which veryfine lines can be printed on a medium to slightly change the medium'sapparent tint, while also conveying digital data. Commonly-ownedapplication Ser. No. 09/074,034, filed May 6, 1998 (now U.S. Pat. No.6,449,377), details how the contours of printed imagery can be adjustedto convey digital data. (That technique can be applied to printed textcharacters, as well as the line art imagery particularly considered.)The assignee's U.S. Pat. No. 5,850,481 details how the surface of paperor other media can be textured to convey optically-detectable binarydata. The assignee's U.S. Pat. Nos. 5,841,886 and 5,809,160 detailvarious techniques for steganographically encoding photographs and otherimagery.

Some watermarking techniques are based on changes made in the spatialdomain; others are based on changes made in transformed domains (e.g.,DCT, wavelet). Watermarking of printed text can be achieved by slightvariations to character shape, character kerning, line spacing, etc.

Data glyph technology, as detailed in various patents to Xerox, isusable in many of the applications detailed herein.

The foregoing is just a gross under-sampling of the large number ofwatermarking techniques. The artisan is presumed to be familiar withsuch art, all of which is generally suitable for use in the applicationsdetailed herein.

More generally, essentially any data encoding method that permitsrecovery of the encoded data from optical scan data can be employed. Barcodes (1D and 2D) are but the most familiar of many suchoptically-detectable data encoding techniques.

Conclusion

Having described and illustrated the principles of our invention withreference to illustrative embodiments, it should be recognized that theinvention is not so limited.

For example, while certain of the embodiments were illustrated withreference to internet-based systems, the same techniques are similarlyapplicable to any other computer-based system. These includenon-internet based services such as America Online and Compuserve,dial-up bulletin board systems, etc. Likewise, for internet-basedembodiments, the use of web browsers and web pages is not essential;other digital navigation devices and other on-line data repositories canbe similarly accessed.

Similarly, while the details of the preferred Bedoop system wereparticularly given, the underlying principles can be employed innumerous other forms.

For example, one other form is to steganographically encode physicalobjects with Digital Object Identifiers (DOIs). The Center for NationalResearch Initiatives and the Digital Object Identifier Foundation(www.doi.org) have performed extensive work in establishing aninfrastructure by which digital objects can be distributed, tracked, andmanaged. Some of this same infrastructure and technology can be adapted,in accordance with the teachings provided above, to associate newfunctionality with physical objects.

Another form is not to reference a remote data repository by dataembedded on an object, but instead to encode the ultimate data directlyon the object. A photograph, for example, can be literally encoded witha telephone number. On presenting the photograph to an optical sensor onthe telephone, the telephone can analyze the optical information toextract the telephone number, and dial the number, without the need forany external data. Similarly, a printed office document (e.g.,spreadsheet) can be encoded with the path and file name of thecorresponding electronic file, obviating the need for indirect linking(e.g., to a database to correlate a UID to a computer address). Most ofthe above-described embodiments are suitable for such direct encoding ofthe related data.

In the business card example given above, the detailed techniques can besupplementary to existing optical character recognition techniques. Thatis, the image data from an optical sensor can be applied both to aBedoop decoder and to an OCR system. Text characters discerned by theOCR system can be entered directly into a contacts manager personaldatabase. The techniques employed in the Bedoop system to locate theencoded object and handle visual distortion (e.g., the visual artifactsdue to scale, rotation, etc.) can advantageously be used in OCRdetection as well, permitting extraction of the OCR information withoutcareful placement of the card.

While certain of the foregoing embodiments made reference to ink-jetprinting, similar advantages can often be obtained with other printingtechnologies, e.g., laser/xerographic printing, offset printing, etc.

In the foregoing embodiments, Bedoop decoding generally proceeded fromimage data obtained from a physical object. However, in some contexts,it is advantageous to Bedoop-decode image data provided electronically,e.g., over the internet.

Likewise, while the foregoing embodiments generally relied on Bedoopimage sensors that stared out for an object at an expected point, inalternative embodiments, sensors that seek rather than stare can beemployed (as was illustrated above in connection with the elevatorexample)

Similarly, while the illustrated embodiments generally employed sensorsthat repeatedly grabbed frames of image data, this need not be the case.Single frame systems, such as flatbed scanners, and video systemsarranged to grab single frames—with or without TWAIN interfaces—canalternatively be used.

As indicated above, while steganographic encoding of the digital data isused in the preferred embodiments, visible forms of digitalencoding—such as bar codes—can naturally be employed where aestheticconsiderations permit.

In certain of the embodiments, digital data conveyed by means other thanoptical can be used. Electromagnetic detection (e.g., of the sort usedin proximity-based card-access systems) can be arranged to decodedigital data, permitting “at-a-distance” reading of data from physicalobjects, just as in the foregoing embodiments.

Since the Bedoop image sensors typically acquire plural frames of data,the extraction of the digital data can be based on more than a singleimage frame. More confidence in the results may be accumulating decodeddata over several frames. Moreover, movement of the object within thesensor's field of view may permit the system to acquire information fromother perspectives, etc., enhancing system operation.

While the preferred embodiments employ 2-D image sensors (e.g., CCDs),other optical sensing technology can alternatively be employed.Supermarket laser scanners, for example, can read bar-code data.Raster-scanning of such systems can permit acquisition of 2-D data(either in bit-mapped form, or grey-scale).

While the illustrated embodiments used a 12/24/24 bit protocol forCLASS/DNS/UID data, other arrangements can of course be used. In someapplications it is advantageous for the protocol to more nearly matchthose commonly used for internet communications. For example, IPaddresses for internet Domain Name Servers (DNS) are presently 32 bits,with extension to 64 or 128 bits foreseen in the near future. The DNSfield in Bedoop systems can be follow the internet standard.

Some embodiments can advantageously employ texture-based Bedoop encodingof objects. Bedoop texturing can be effected by various means, includingpressure rollers, chemical or laser etching, etc.

While the foregoing embodiments have generally employed planar objectsto convey the digital encoding, this need not be the case. Objects ofother shapes can likewise be employed. Some shapes present relativelystraightforward image processing tasks. Data imaged from a soft drinkcan or other cylindrical surface, for example, is fairly easy to remapusing known geometrical transforms so as to essentially “unwrap” theprinting from the can. Other geometries can present more complexre-mappings, but are likewise generally within the capabilities of theartisan. (Such remapping is facilitated by encoding in the data certainreference markings, such as subliminal graticules, etc. The unknown 3Dshape of the object being imaged can usually be inferred from theapparent warping of the reference markings in the 2D image datagenerated by the scanner. Once the warping is characterized, it isgenerally straightforward to un-warp so as to prepare the image data fordecoding.)

It was once popular to predict that paper documents would be replacedwith electronic media. In hindsight, electronic media may be recognizedas a poor surrogate for paper. Electronic media conveys informationflawlessly, but is lacking in experiential attributes. We can holdpaper, stack it, own it, deface it, give it, guard it, etc. It providesan opportunity for physical dominion entirely lacking with electronicmedia.

From the foregoing discussion it can be seen that, rather than replacingpaper with electronic media, perhaps the future lies in giving paperdigital attributes—hybridizing the physical experience of paper with thetechnical advantages of digital media. Such an arrangement makesavailable a great wealth of new functionality, now accessible throughfamiliar paper items, rather than through a “computer input peripheral.”

To provide a comprehensive disclosure without unduly lengthening thisspecification, applicants incorporate by reference the patents,applications, and publications identified above.

In view of the many embodiments to which the principles of our inventionmay be applied, it should be recognized that the detailed embodimentsare illustrative only and should not be taken as limiting the scope ofour invention. Rather, we claim as our invention all such embodiments asfall within the scope and spirit of the following claims, andequivalents thereto.

APPENDIX A Application 60/134,782 FIELD OF THE INVENTION

The present invention relates to applications of digital watermarking inconjunction with audio, video, imagery, and other media content.

BACKGROUND

Watermarking (or “digital watermarking”) is a quickly growing field ofendeavor, with several different approaches. The present assignee's workis reflected in U.S. Pat. Nos. 5,841,978, 5,768,426, 5,748,783,5,748,763, 5,745,604, 5,710,834, 5,636,292, 5,721,788, and laid-open PCTapplications WO97/43736 and WO99/10837. Other work is illustrated byU.S. Pat. Nos. 5,734,752, 5,646,997, 5,659,726, 5,664,018, 5,671,277,5,687,191, 5,687,236, 5,689,587, 5,568,570, 5,572,247, 5,574,962,5,579,124, 5,581,500, 5,613,004, 5,629,770, 5,461,426, 5,743,631,5,488,664, 5,530,759, 5,539,735, 4,943,973, 5,337,361, 5,404,160,5,404,377, 5,315,098, 5,319,735, 5,337,362, 4,972,471, 5,161,210,5,243,423, 5,091,966, 5,113,437, 4,939,515, 5,374,976, 4,855,827,4,876,617, 4,939,515, 4,963,998, 4,969,041, and published foreignapplications WO 98/02864, EP 822,550, WO 97/39410, WO 96/36163, GB2,196,167, EP 777,197, EP 736,860, EP 705,025, EP 766,468, EP 782,322,WO 95/20291, WO 96/26494, WO 96/36935, WO 96/42151, WO 97/22206, WO97/26733.

Most of the work in watermarking, however, is not in the patentliterature but rather in published research. In addition to thepatentees of the foregoing patents, some of the other workers in thisfield (whose watermark-related writings can by found by an author searchin the INSPEC database) include I. Pitas, Eckhard Koch, Jian Zhao,Norishige Morimoto, Laurence Boney, Kineo Matsui, A. Z. Tirkel, FredMintzer, B. Macq, Ahmed H. Tewfik, Frederic Jordan, Naohisa Komatsu, andLawrence O'Gorman.

The artisan is assumed to be familiar with the foregoing prior art.

In the present disclosure it should be understood that references towatermarking encompass not only the assignee's watermarking technology,but can likewise be practiced with any other watermarking technology,such as those indicated above.

Watermarking has various uses, but the present specification detailsseveral new uses that provide functionality and features not previouslyavailable.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 is a diagram showing the participants, and channels, involved inthe distribution of music.

FIG. 2 shows a conceptual model of how music artists, record labels, andE-Music distributors can all interact with a Media Asset ManagementSystem, of which several are detailed in the following specification.

DETAILED DESCRIPTION

For expository convenience, much of the following discussion focuses onmusic, but the same principles and techniques are largely or whollyapplicable to other source data, whether non-music audio, video, stillimagery, printed materials, etc.

Music Asset Management

Referring to the figures, the music distribution process begins with acreative artist 10. The artist's music has traditionally beendistributed by a record label 12. (While the following discussion refersto distribution through such a label, it should be understood that suchdistribution can just as well be effected directed under the artist'scontrol, without a record label intermediary.)

In traditional distribution 14, the record label produces tangiblemedia, such as records, tapes, videos (e.g. music videos), and CDs 16.These media are physically distributed to end-consumers 18.Additionally, the label 12 distributes the music media to outlets 20,such as radio and TV stations, cable and satellite systems, etc., whichbroadcast (or narrowcast) the artist's work to an audience. Distributionthrough such media outlets may be monitored by playout trackingservices. Playout tracking data, collected by firms including Arbitron,Nielsen, ASCAP, BMI, etc., can be used to compute royalty payments, toverify broadcast (e.g. for advertising), etc.

Increasingly, the distribution of the music to the media outlets isperformed electronically. Such distribution first took the form ofanalog audio over high quality landlines or satellite channels. Digitalaudio quickly supplanted analog audio in such distribution channels dueto higher fidelity.

More recently, distribution of the music from the record labels to themedia outlets has occurred over secure links, now including theinternet. Such security was first provided simply by scrambling theaudio signal or data. More sophisticated “container”-based systems arenow coming into vogue, in which the audio is “packaged” (often inencrypted form) with ancillary data.

Electronic distribution of music to the consumer is also gainingpopularity, presently in the MP3 format primarily. The music providersmay deal directly with the public, but more commonly effect suchconsumer distribution through a newly emerging tier of digital mediaoutlets, such as internet sites that specialize in music. From suchsites, consumers can download digital audio files into personal digitalaudio players. (The Diamond Rio, and the Audible MobilePlayer devicesare some of the first of what will doubtless be a large number ofentrants into this personal internet audio appliance market.) Or thedownloaded data can be stored by the consumer-recipient onto any otherwriteable media (e.g. hard disk, CD, DVD, tape, videotape, etc.).Typically a personal computer is used for such downloading, but thisintermediary may be dispensed with by coupling next generation ofpersonal audio appliances to an internet-like link.

The data downloaded by the consumer can be stored either in the nativedigital format, translated into another digital format (whichtranslation may include decryption), converted into analog and recordedin analog form, etc.

Unauthorized copying or use of the music can occur anywhere in theforegoing channels. However, one of the greatest risks occurs once themusic has been delivered to the consumer (whether by tangible media, bytraditional broadcast media outlets, by emerging digital distribution,or otherwise).

The general idea of embedding auxiliary data into music (i.e.watermarking) has been widely proposed, but so far has been of limitedapplicability.

For example, GoodNoise is planning to embed a digital signature—termed amultimedia identifier, or MMI—in its MP3 music. MMI will register thesong and its author with a licensing number. In addition to providinginformation about the songwriter and distributor, this digital encodingmay also include lyrics, liner notes, and other information. But all ofthe proposed uses serve only to convey information from the distributorto the consumer; use for “tracking” is actively disclaimed. (Wired News,“GoodNoise Tags MP3 Files,” Feb. 3, 1999.)

The Genuine Music Coalition—a partnership of various companies in themusic distribution business—likewise has announced plans to employwatermarking of MP3 music. The watermarking technology, to be providedby Liquid Audio, will convey data specifying the artist or producercontact, copyright data, and a number to track ownership. The Coalitionhopes that the provision of this embedded information will help thwartpiracy. Industry observers believe Liquid Audio will next introduceplayback technology only plays audio in which its watermark is detected.(Wired News, “Liquefying MP3,” Jan. 23, 1999)

A similar initiative has been announced by the Recording IndustryAssociation of America (RIAA). Termed the Secure Digital MusicInitiative (SDMI), the program seeks to define a voluntary specificationthat will assure proper compensation to those who produce and distributemusic. One element of the system will likely be a watermarkingcomponent. (Dow Jones Newswire, “Spurred By Maverick Technology, MusicIndustry Eyes Web,” Dec. 31, 1998.)

Yet another initiative has been announced by Solana and ASCAP. Othercompanies promoting watermarking for music include Aris Technology,MCY.com, and AudioSoft.

The watermark payload can represent various types of data. An exemplarypayload includes data relating to the artist, distribution entity,title, and copyright date/proprietor. Additionally, the payload caninclude a digital object identifier—an ISBN-like number issued by acentral organization (e.g. a rights management organization) to uniquelyidentify the work.

Such payload data can be encoded literally (e.g. the title by a seriesof ASCII characters, etc.). In other embodiments, codes or abbreviationscan be employed—with each code having a known meaning. In still otherembodiments, the data can be meaningless by itself, but may serve as akey (e.g., a Unique Identifier, or UID) into a remote data database orrepository. An example of such a remote data repository is a web site ata Master Global Address (MGA) associated with content, as detailedbelow.

An exemplary data payload may, for example, have the following format: AB C D E F G H IWhere A is a six-byte (8-bits to a byte) ASCII string serving as adigital object identifier (which may serve as a link to a Master GlobalAddress through a default name server, as discussed below), B is atwo-byte ASCII field serving as a key into an “artist” field of theremote database, C is a three-byte ASCII field serving as a key into a“title” field of the remote database; D is a 14-bit field serving as akey into a “label” field of the remote database, E is an 8-bit integerrepresenting the work's year of first publication (with 0 representingthe year 2000); F is a 10-bit field serving as a key into a “price”field of the remote database, G is a two-byte usage control string(detailed below), H is a streaming data channel, and I is a string ofbits serving as a cyclic redundancy checksum for the foregoing. (Moresophisticated error correcting checksums can, of course, be employed.)This payload format totals 136 bits, exclusive of the CRC coding and thestreaming data channel.

This payload is encoded repeatedly, or redundantly through the music, sothat the full payload can be decoded from partial excerpts of the music.

The encoding is also desirably perceptually adaptive, so that higherenergy encoding is employed where the listener is less likely toperceive the additional “noise” introduced by the encoding, and viceversa. Various techniques for perceptually adaptive encoding are known.For example, some tie the amplitude of the encoded signal to theinstantaneous amplitude of the music. Others exploit psychoacoustic“masking” of one signal by a spectrally-or temporally-adjoining signalof higher energy. Still other approaches fill gaps in the music'sspectrum with watermark energy. These and other techniques are detailedin the patents incorporated by reference.

In other embodiments, perceptually adaptive encoding is not used. Insome such embodiments, no tailoring of the temporal or spectralcharacteristics of the watermark signal is employed. In others, thewatermark signal is spectrally filtered to emphasize low frequency audiocomponents (e.g. less than 500 hz), high frequency audio components(e.g. higher than 2500 hz), or mid-frequency audio components (500-2500hz).

The streaming data field channel (H) is a medium by which data can beconveyed from a distribution site (or other site) to the end user. Suchdata may be entirely unrelated to the underlying work. For example, itmay serve a utilitarian purpose, such as conveying data to a memory inthe consumer device to replace previously-stored data that isout-of-date. It may be a commercial channel on which bandwidth is soldfor access to the consumer or the consumer's device. Essentially anypurpose can be served by this streaming data field. Unlike most of theother fields, the streaming data field may not endlessly repeat the samedata, but can convey data that changes with time.

Desirably, the encoding is performed in a manner permitting recovery ofthe watermark data even if the audio is corrupted, e.g. by formatconversion, re-sampling, tape wow and flutter, compression, coding, orvarious forms of audio processing (e.g. filtering, pre-emphasis,re-scaling, etc.). One way to provide for such robustness is to encode asignal of known character that can be recognized through all suchcorruption. By identifying such known signal, the watermark signal canthen be decoded. (The known signal can take various forms, e.g. asynchronization signal, a marker signal, calibration signal, a universalcode signal as described in applicant's patents, etc.)

In some embodiments, a watermark “dial-tone” signal is provided. Thisdial-tone signal is a low amplitude, relatively wideband, repetitivesignal that commonly conveys only limited information (e.g. a single bitof information). Its presence in an audio signal can serve as a “do notrecord,” or similar instruction signal. Alternatively, or in addition,the dial-tone signal can serve as an aid in “locking” to a plural-bitdigital watermark signal that is also encoded in the audio. For example,the cyclical repetition of the signal can serve to identify the start ofthe plural-bit digital watermark signal. Or the spectrum or repetitionrate of the signal can identify any temporal corruption of the audio. Anexemplary such signal is detailed as a “simple universal code” in U.S.Pat. No. 5,636,292.

A track of music can be pre-authorized for specified types of use. Forexample, the usage control string of the watermark payload may include asix-bit field detailing the classes of devices for which the audio isauthorized. Each bit would correspond to a different class of device.Class 1 devices may be personal playback devices with only analog-audiooutput. Class 2 devices may be personal entertainment devices capable ofoutputting music in digital (e.g. MP3, redbook, *.WAV) format, as wellas analog audio. Class 3 devices may be personal computer systems (i.e.with essentially unlimited ability for processing and outputting digitalaudio). Etc., etc. A device to which such MP3 audio is provided wouldcheck the usage control string data to determine whether it isauthorized to utilize the audio. A personal playback device withanalog-only output, for example, would examine the first bit of theusage control string. If it was “1,” the device would be authorized touse (i.e. playback) the MP3 data; if it was a “0,” the device wouldrefuse to play the music.

In addition to pre-authorization for certain classes of devices, theusage control string can also include bits indicating the number ofpermitted playbacks. This data can be encoded in bits seven throughnine, representing eight possibilities:

-   -   0—no playback permitted    -   1—single playback permitted    -   2—two playbacks permitted    -   3—three playbacks permitted    -   4—four playbacks permitted    -   5—five playbacks permitted    -   6—10 playbacks permitted    -   7—unlimited playbacks permitted    -   8—refer to associated data (within the watermark, or stored at a        remote site) which specifies number of permitted playbacks.

The playback device may include a non-volatile store in which the numberof permitted playbacks is stored for each track of music. The devicewould decrement this number at the beginning of each playback.

The usage control string can also include a two-bit field (bits ten andeleven) indicating recording permissions. A value of 0 means that datacorresponding to the MP3 audio (regardless of digital format) shouldnever be made available to another digital device. A value of 1 meansthat the data corresponding to the MP3 data may be made available onceto another digital device. A value of 2 means that the data may be madeavailable an unlimited number of times to other digital devices. (Value3 is reserved.)

Another data field that can be included in an audio watermark is arating that indicates age-appropriateness. Music with violence or sexualthemes might be given a rating akin to the MPAA “PG-13” or “R” rating.Audio appliances may be programmed to recognize the rating of incomingmusic, and to interrupt playback if the rating exceeds a certainthreshold setting. Various known techniques can be employed to assurethat such settings cannot readily be changed, e.g., by juvenilelisteners.

Another data field that can be included in an audio watermark is a datefield. This field can indicate either the date the music waswatermarked, or a date in the future on which certain rights associatedwith the music should change. Some consumers, for example may not wishto purchase perpetual playback rights to certain musical selections. Theright to play a selection for 6 months may suffice for many consumers,especially if the price is discounted in view of the limited term. Suchan arrangement would not be wholly disadvantageous to musicdistributors, since some consumers may end up purchasing music twice iftheir initial assessment of a musical selection's appeal was tooshort-sighted. (Naturally, the playback equipment would require a sourceof real-time clock data against which the date field in the watermarkcan be checked to ensure that the playback rights have not yet expired.)

Another of the data fields that can be included in an audio watermarkspecifies technical playback parameters. For example, the parameter cancause the playback appliance to apply a spectral equalization thatfavors bass frequencies, or treble frequencies, or mid-rangefrequencies, etc. Other pre-configured equalization arrangements cansimilarly be invoked responsive to watermark data. Likewise, theparameter can invoke special-effects provided by the playback appliance,e.g., echo effects, reverb, etc. (Again, such parameters are usuallyrepresented in an abbreviated, coded form, and are interpreted inaccordance with instructions stored in a memory (either in the playbackappliance, or linked thereto).

The same data fields and principles can be applied to non-audio content.In video, for example, watermarked data can adaptively control thedisplay monitor or playback parameters (e.g., color space) to enhancethe viewing experience.

Music Asset Management/Commerce

The majority of domestic music piracy is not organized. Rather, it is acrime of opportunity and convenience. If the crime were made moredifficult, the alternative of obtaining a copy through legitimatechannels would be less onerous. Similarly, if the procedure forobtaining a copy through legitimate channels were simplified, theincentive for piracy would be reduced. Watermarking facilitatesboth—making the crime more difficult, and making legitimate musicacquisition easier.

Consider, for example, the pricing of music in conventional recordstores. A CD (compact disk) may cost $15, but its sale may be driven byjust one or two popular songs on the disk. To obtain these songs, theconsumers must purchase the entire disk, with perhaps a dozen songs ofno particular interest. This, in essence, is a tying arrangement thatbenefits the record labels while prejudicing the consumers. Given thesecircumstances, and a ready opportunity to make copies, it is notsurprising that customers sometimes make illicit copies.

One classic technique of avoiding purchase of a complete collection ofmusic, when only one or two songs is desired, is to record the music offthe radio. While of dubious legality, this technique was popular in theera of combined cassette/radio players. However, the desired music wassometimes difficult to encounter in a radio broadcast, and the qualitywas less than superb.

The combined cassette/radio player has now evolved into a generalpurpose computer with wide-ranging functionality, and othersophisticated devices. Music can be acquired off the web, and can berecorded in various forms (e.g. in a personal MP3 player, stored on ahard disk, stored on a writeable CD-ROM, played back and recorded onanalog cassette, etc., etc.). The quality can be quite high, and theerratic broadcast time problems of radio broadcasts have been overcomeby the web's on-demand delivery mechanisms. (Moreover, the music can bedownloaded in faster-than-realtime, a further benefit overrecording-off-the-air techniques)

One hybrid between the new and old is a novel radio (e.g., for use in acar) that has a “capture” button on the front panel (or other form ofuser interface, e.g., a Capture icon on a GUI). If a user hears a songthey want to record and keep, they press the Capture button while thesong is playing. In response, the radio device decodes a watermarkembedded in the music, and thereby knows the identity of the music. Theradio then makes a wireless transmission identifying the user and thedesired song. A local repeater network picks up the wireless signal andrelays it (e.g. by wireless rebroadcast, by modem, or othercommunication medium) to a music clearinghouse. The clearinghousecharges the user a nominal fee (e.g. via a pre-arranged credit card),and queues the music for download to a predetermined location associatedwith the user.

In one embodiment, the predetermined location is the user's owncomputer. If a “live” IP address is known for the user's computer, themusic can be transferred immediately. If the user's computer is onlyoccasionally connected to the internet, the music can be stored at a website (e.g. protected with a user-set password), and can be downloaded tothe user's computer whenever it is convenient.

In other embodiments, the predetermined location is a personal musiclibrary maintained by the user. The library can take the form, e.g., ofa hard-disk or semiconductor memory array in which the user customarilystores music. This storage device is adapted to provide music data toone or more playback units employed by the user (e.g. a personal MP3player, a home stereo system, a car stereo system, etc.). In mostinstallations, the library is physically located at the user'sresidence, but could be remotely sited, e.g. consolidated with the musiclibraries of many other users at a central location.

The personal music library can have its own internet connection. Or itcan be equipped with wireless capabilities, permitting it to receivedigital music from wireless broadcasts (e.g. from the clearinghouse). Ineither case, the library can provide music to the user's playbackdevices by short-range wireless broadcast.

By such arrangement, a user can conveniently compile an archive offavorite music—even while away from home.

Many variants of the foregoing are of course possible. The radio can bea portable unit (e.g. a boombox, a Walkman radio, etc.), rather than anautomotive unit. The UI feature employed by the user to initiate capturea musical selection need not be a button (physical or on-screen). Forexample, in some embodiments it can be a voice-recognition system thatresponds to spoken commands, such as “capture” or “record.” Or it can bea form of gesture interface.

Instead of decoding the watermark only in response to the user's“capture” command, the radio can decode watermarks from all receivedprograms, and keep the most recent in a small FIFO memory. By sucharrangement, the user need not issue the capture instruction while thesong is playing, but can do so even after the song is finished.

In some embodiments, data corresponding to the watermark can be madeavailable to the user in various forms. For example, it can be presentedto the user on an LCD screen, identifying the artist and song currentlyplaying. If a corresponding UI button is activated, the device canso-identify the last several selections. Moreover, the data need not bepresented to the user in displayed form; it can be annunciated by knowncomputer-speech technologies instead.

In embodiments in which the watermark does not convey ASCII text data,but instead conveys UIDs, or coded abbreviations, the device mustgenerally interpret this data before presenting it to the user. In anillustrative embodiment, the device is a pocket-sized FM radio and isequipped with a 1 megabyte semiconductor non-volative RAM memory. Thememory includes a data structure that serves as a look-up table,matching code numbers to artist names and song titles. When the userqueries the device to learn the identify of a song, the memory isindexed in accordance with one or more fields from the decodedwatermark, and the resulting textual data from the memory (e.g. songtitle and artist) is annunciated or displayed to the user.

In most applications, such memory will require frequent updating. The RFreceiver provides a ready mechanism for providing such updated data. Inone embodiment, the radio “awakens” briefly at otherwise idle momentsand tunes to a predetermined frequency at which updated data for thememory is broadcast, either in a baseband broadcast channel, or in anancillary (e.g. SCA) channel.

In variants of the foregoing, internet delivery of updated memory datacan be substituted for wireless delivery. For example, the artist/songtitle memory in the personal player can be updated by placing the playerin a “nest” every evening. The nest (which may be integrated with abattery charger for the appliance) can have an internet connection, andcan exchange data with the personal device by infrared, inductive, orother proximity-coupling technologies, or through metal contacts. Eachevening, the nest can receive an updated collection of artists/songtitles, and can re-write the memory in the personal device accordingly.By such arrangement, the watermark data can always be properlyintepreted for presentation to the user.

The “Capture” concepts noted above can be extended to other functions aswell. One is akin to forwarding of email. If a consumer hears a songthat another friend would enjoy, the listener can send a copy of thesong to the friend. This instruction can be issued by pressing a “Send”button, or by invoking a similar function on a graphical (or voice- orgesture-responsive) user interface. In response, the applianceso-instructed can query the person as to the recipient. The person candesignate the desired recipient(s) by typing in a name, or a portionthereof sufficient to uniquely identify the recipient. Or moretypically, the person can speak the recipient's name. As is conventionalwith hands-free vehicle cell phones, a voice recognition unit can listento the spoken instructions and identify the desired recipient. An“address book”-like feature has the requisite information for therecipient (e.g., the web site, IP address, or other data identifying thelocation to which music for that recipient should stored or queued, theformat in which the music should be delivered, etc.) stored therein. Inresponse to such command, the appliance dispatches instructions to theclearinghouse, including an authorization to debit the sender's creditcard for the music charge. Again, the clearinghouse attends to deliveryof the music in a desired manner to the specified recipient.

Still further, a listener may query the appliance (by voice, GUI orphysical button, textual, gesture, or other input) to identify CDs onwhich the then-playing selection is recorded. Or the listener may querythe appliance for the then-playing artist's concert schedule. Again, theappliance can contact a remote database, relay the query, and forwarddata from the watermark payload identifying the artist and/or song titleto which the query relates. The database locates the requested data, andrelays same back to the appliance for presentation (via a display, bymachine speech, or other output) to the user. If desired, the user cancontinue the dialog with a further instruction, e.g., to buy one of theCDs on which the then-playing song is included. Again, this instructionmay be entered by voice, GUI, etc., and dispatched from the applicanceto the clearinghouse, which can then complete the transaction inaccordance with pre-stored information (e.g. credit card account number,mailing address, etc.). A confirming message is relayed to the appliancefor presentation to the user.

While the foregoing transactions require a link to a remote site ordatabase, other watermark-based consumer services can be providedwithout such a link. For example, a user can query the appliance as tothe artist or song-title of the selection currently playing. Theappliance can consult the embedded watermark data (and optionallyconsult a memory to determine the textual names associated with codedwatermark data), and provide the requested information to the user(e.g., by a display, annunciation, or other output).

The foregoing concepts (e.g. Capture, Send, etc.) can also be employedin connection with internet-rather than radio-delivery of music. (Thefollowing discussion is illustrated with reference to the “Capture”function, but it will be recognized that the other earlier-discussedfeatures can be similarly implemented.)

There are many commercial web sites that sell audio (in CD form orotherwise), and offer limited free music downloads, (or music clips) asan enticement to lure consumers. But there are also a great number ofmusic web sites that have no commercial pretense. They are hosted bymusic lovers strictly for the enjoyment of other music lovers. Whenmusic is downloaded from such a web site, the end-user's computer cananalyze the digital data to decode watermark data therefrom. Again, theuser can be presented with a “Capture” button that initiates acommercial transaction, by which a complete copy of the then-downloadedaudio is sent to a prearranged storage location, and the user's creditcard is debited accordingly. This transaction can occur independently ofthe site from which the music is downloaded (e.g. through theclearinghouse referenced above).

While the “Capture” button can be presented on the web-site, this wouldgenerally not be in keeping with the non-commercial nature of such websites. Instead, in an exemplary embodiment, the Capture feature is asoftware program resident at the user's computer. When this softwareprogram is invoked by the user, a socket channel is instantiated betweenthe user's computer and the clearinghouse over the then-existinginternet connection. The decoded watermark data and user ID istransmitted to the clearinghouse over this channel, without interruptingthe user's other activity (e.g. downloading music from thenon-commercial web site). In response, the clearinghouse transmits themusic to the prearranged location and attends to billing.

In some embodiments, a watermark detector is included as part of theoperating system, and constantly monitors all TCP/IP, or other internet,data received by the user's computer, for the presence of watermarks. Insuch case, when the Capture feature is invoked, the program examines amemory location in which the operating system stores the most-recentlyreceived watermark data. In another embodiment, the computer does notmonitor all internet traffic for embedded watermark data, but includesan API that can be called by the Capture program to decode a watermarkfrom the data then being received. The API returns the decoded watermarkdata to the Capture program, which relays same to the clearinghouse, asabove. In still another embodiment, the watermark decoder forms part ofthe Capture program, which both decodes the watermark and relays it tothe clearinghouse when the Capture program is invoked by the user.

There are various techniques by which the Capture program can beselectively invoked. One is by a keyboard macro (e.g. by a combinationof keyboard keys). Another is by a program icon that is always presentedon the screen, and can be double-clicked to activate. (Again,confirmation processes may be called for, depending on the likelihood ofinadvertent invocation.) Many other techniques are likewise possible.

In the just-contemplated scenario, the Capture operation is invokedwhile the user is downloading music from a non-commercial web site. Thisseems somewhat redundant, since the downloading—itself—is transferringmusic to the user's computer. However, the Capture operation providesadded value.

In the case of streaming audio, the audio is not typically stored in alocation in which it can be re-used by the consumer. It can belistened-to as delivered, but is then gone. Capturing the audio providesthe user a copy that can be played repeatedly.

In the case of downloaded music files, the music may have been encodedto prevent its recordal on other devices. Thus, while the user maydownload the music onto a desktop computer, copy-prevention mechanismsmay prevent use of that file anywhere else, e.g. on a portable musicappliance. Again, Capturing the audio provides the user a copy that canbe transferred to another device. (The music file provided by theclearinghouse can have copy-prevention limits of its own—e.g., the filecan be copied, but only once, or the file can be copied only ontodevices owned by the user.)

(Confirmation of device ownership can be implemented in various ways.One is to identify to the clearinghouse all music devices owned by auser at the time the user registers with the clearinghouse (supplementedas necessary by later equipment acquisitions). Device IDs associatedwith a user can be stored in a database at the clearinghouse, and thesecan be encoded into the downloaded music as permitted devices to whichthe file can be copied, or on which it can be played.)

The commerce opportunity presented by non-commercial music.web-sites isbut one enabled by digital watermarks. There are many others.

To take one example, consider the media by which music and artists arepresently promoted. In addition to radio airtime, these include musicvideos (a la MTV), fan magazines, web advertisements, graphical icons(e.g. the Grateful Dead dancing bears), posters, live events, movies,etc. Watermarked data can be used in all such media as a link in acommercial transaction.

A poster, for example, typically includes a photo of the artist, and maycomprise cover-art from a CD. The photo/art can be digitally watermarkedwith various types of data, e.g., the artist's identify, the recordlabel that distributes the artist's work, the music project beingparticularly promoted by the poster (e.g. a CD, or a concert tour), afan web-site related to the artist; a web-site hosted by the recordlabel for selling audio in CD or electronic form, a web-site from whichfree music by the artist can be downloaded, data identifying the posteritself, etc.

A user, equipped with a portable appliance that merges the functions ofpalmtop computer and digital camera, can snap an image of the poster.The processor can decode the watermarked data, and initiate any ofvarious links based on the decoded data.

In an exemplary embodiment, after snapping the picture, the user invokesa software program on the device that exposes the various links gleanedfrom the snapped image data. Such a program can, for example, presentthe option of linking to the artist's fan web site, or downloading freestreaming audio or music clips, or ordering the promoted CD, orrequesting the above-noted clearinghouse to download a personal copy ofselected song(s) by the artist to the user's personal music library,etc. (The device is presumed to have a wireless internet link. Indevices not having this capability, the requested actions can be queuedand automatically executed when a link to the internet is available.)

Still more complex transactions can be realized with the use of a remotedatabase indexed by digital watermark fields decoded from the poster.For example, the poster may promote a concert tour. Fields of thedigital watermark may identify the artist (by a code or full text), anda web site or IP address. The user appliance establishes a link to thespecified site, and provides the artist identifier. In response, thesite downloads the tour schedule for that artist, for display on thedevice. Additionally, the downloaded/displayed information can include atelephone number that can be used to order tickets or, more directly,can indicate the class of seats still available at each (or a selected)venue, and solicit a ticket order from the user over the device. Theuser can supply requested information (e.g. mailing address and chargecard number) over the return channel link (wireless or wired, as thecase may be), and the ticket(s) will be dispatched to the user. In thecase of a wireless link, all of this can occur while the user isstanding in front of the movie poster.

Similar systems can be implemented based on watermark data encoded inany other promotional media. Consider music videos. Using knownTV/computer appliances, watermark data added to such videos can readilybe decoded, and used to establish links to audio download, CD-sales, fanclub, concert ticket outlet web sites, etc., as above.

Even live events offer such watermark-based opportunities. The analogaudio fed to public address or concert speakers can be watermarked(typically before amplification) to encode plural-bit digital datatherein. A next generation personal music appliance (e.g. one with awireless interface to the internet) can include analog record capability(e.g. a built-in microphone, analog-to-digital converter, MP3 encoder,coupled to the unit's semiconductor memory). A user who attends a liveevent may record an excerpt of the music. The watermark can then bedecoded, and the extracted data used to access the links and commerceopportunities reviewed above.

Cinema movies offer both audio and visual opportunities forwatermark-based commerce opportunities. Either medium can be encoded toconvey information of the types reviewed above. A personal appliancewith image- or audio-capture capabilities can capture an excerpt of theaudio or imagery, decode the watermark data therefrom, and perform anyof the linking, etc., functions reviewed above.

The consumer-interest watermarks reviewed above are only exemplary. Manyothers will be recognized as useful. For example, promotional clipspresented before a feature film presentation can include watermark datathat point (either by a literally encoded web address link, or by an IDcode that indexes a literal link in a remote link database) to reviewercritiques of the previewed movies. Watermark data in a featured filmpresentation can lead to web sites with information about the moviestars, the director, the producer, and can list other movies by each ofthese persons. Other watermark-conveyed web links can presentopportunities to buy the movie on videotape, to purchase the moviesoundtrack, to buy movie-related toys and games, etc.

More on Device Control

Much of the foregoing has focused on watermark encoding to provideenhanced customer experiences or opportunities. Naturally, watermarksdata can alternatively, or additionally, serve the interests of themedia owner.

To illustrate, consider watermarked music. The media owner would be bestserved if the watermark serves dual purposes: permissive andrestrictive. Permissively, music appliances can be designed to play (orrecord) only music that includes an embedded watermark signaling thatsuch activity is authorized. By this arrangement, if music is obtainedfrom an unauthorized source and does not include the necessarywatermark, the appliance will recognize that it does not have permissionto use the music, so will refuse requests to play (or record).

As noted, music appliances can respond restrictively to the embeddedwatermark data to set limits on use of the music. Fields in thewatermark can specify any or all of (or others in addition to) (a) thetypes of devices on which the music can be played (b) the types ofdevices on which the music can be recorded; (c) the number of times themusic can be played; (d) the number of times the music can be recorded,etc.

The device restrictions (a) and (b) can be of various types. In someembodiments, the restrictions can identify particular units (e.g. byserial number, registered owner, etc.) that are authorized toplay/record the encoded music. Or the restrictions can identifyparticular classes of units (e.g., battery-powered portable players withmusic memories of less than 50 megabytes, disk-based dedicated musicappliances, general purpose personal computers, etc.) Or therestrictions can identify particular performance quality criteria (e.g.,two channel, 16-bit audio at 44.1 KHz sample rate, or lower quality).

The use restrictions (c) and (d) can likewise be of various types.Examples include “do not copy,” “copy once only,” “unrestricted copyingpermitted,” “play once,” “play N times” (where N is a parameterspecified elsewhere in the watermarked data, or by reference to adatabase indexed by a watermark data field), “unrestricted playingpermitted,” etc.

It is straightforward to design a music appliance to respond to usagelimits of zero (e.g. “do not copy”) and infinity (e.g. “unrestrictedcopying permitted,” and “unrestricted playing permitted”). The devicesimply examines one or more bits in the watermark data, and permits (orrefuses) an operation based on the value thereof.

Implementation of the other usage-control restrictions can proceed invarious ways. Generally speaking, the stored music can be altered togive effect to the usage-control restrictions. For example, if the musicis “record-once,” then at the time of recording, the appliance can alterthe music in a fashion indicating that it now has “do not record”status. This alteration can be done, e.g., by changing the watermarkdata embedded in the stored music (or adding watermark data), bychanging other data stored in association with the music, etc. If theoriginal signal is stored (as opposed, e.g., to a streaming signal, suchas an internet or wireless transmssion), it too should be so-altered.

Likewise with playback limitations. The number of playbacks remainingcan, e.g., be encoded in an updated watermark in the music, be trackedin a separate counter, etc.

More particularly considering the “copy once” usage restriction, anillustrative embodiment provides two distinct watermark payload bits: a“copy once” bit and a “copy never” bit. When originally distributed(whether by internet, wireless, or otherwise), the copy once” bit isset, and the “copy never” bit is un-set.

When music encoded in this fashion is provided to a compliant recordingdevice, the device is authorized to make one copy. (A compliant deviceis one that recognizes encoded watermark data, and behaves as dictatedby the watermark.) When this privilege is exercised, the recordingdevice must alter the data to ensure that no further copying ispossible. In the illustrated embodiment, this alteration is effected bythe recording device adding a second watermark to both the music, withthe “copy never” bit asserted. The second watermark must generally beencoded in an “orthogonal” domain, so that it will be detectablenotwithstanding the continued presence of the original watermark.Compliant equipment must then check for both watermarks, and refuse tocopy if either is found to have the “copy never” bit asserted.

One advantage to this arrangement is that if the watermark signal hasundergone some form of corruption (e.g. scaling or resampling), thefirst watermark may have been weakened. In contrast, the secondwatermark will be native to the corrupted signal, and thus be moreeasily detected. (The corruption may also contribute to theorthogonality of one watermark relative to the other, since the twowatermarks may not have precisely the same time base or otherfoundation.)

An alternative approach is not to encode the “copy never” bit in theoriginal music, but leave this bit (in whatever manifestation) blank(i.e. neither “1” nor “0”). In transform-based watermark techniques,this can mean leaving transform coefficient(s) corresponding to the“copy never” bit un-changed. If the watermarking is effected in thetemporal sample domain (or spatial domain, for image data), this canmean leaving certain samples (pixels) unmodified. The recording devicecan then alter the transform coefficients and/or samples as necessary toassert the previously-unencoded “copy never” bit when the permittedrecording is made.

In such a system, compliant recording devices check for the “copy never”bit in the sole watermark, and refuse to make a copy if it is asserted(ignoring the value of any “copy once” bit).

A third approach to “copy once” is to set both the “copy once” and “copynever” bits, but set the former bit very weakly (e.g. using lower gainand/or high frequency DCT coefficients that do not survive certainprocessing). The frail “copy once” bit is designed not to survive commoncorruptions, e.g., resampling scaling, digital to analog conversion,etc. To further assure that the “copy once” bit is lost, the recordingdevice can deliberately add a weak noise signal that masks this bit(e.g. by adding a noise signal in the frequency band whose DCTcoefficient conveys the “copy once” bit). In contrast, the “never copy”bit is unchanged and reliably detectable.

In such a system, compliant devices check for the “copy once” bit in thesole watermark, and refuse to make a copy if it is not detected as set.

These three examples are but illustrations of many possible techniquesfor changing the rights associated with a work. Many other techniquesare known. See, e.g., the proposals for watermark-based copy controlsystems for digital video at the Copy Protection Technical WorkingGroup, http://www.dvcc.com/dhsg/, from which certain of the foregoingexamples are drawn. See also Bloom et al, “Copy Protection for DVDVideo,” IEEE Proceedings, Special Issue on Identification and Protectionof Multimedia Information, June, 1999.

Scaleability

One feature that is desirable in many detector embodiments isscaleability. This refers to the ability of a detector to scale itscomputational demands to match the computational resources available toit. If a detector is running on a high performance Pentium IIIworkstation, it should be “doing more,” than if the same detector isrunning on a slow microcontroller. One way scalability can be achievedis by processing more or less chunks of input data (e.g. temporalexcerpts of music, or blocks/macroblocks of pixels in a frame of videodata) to decode watermarks. For example, an input audio stream might bebroken into chunks of one second each. A fast processor may completedecoding of each chunk in less than a second, permitting it successivelyto process each chunk in the data stream. In contrast, a slow processormay require two and a half seconds to decode the watermark from a chunk.While it is processing a first chunk, the second and third pass byun-decoded. The processor next grabs and processes the fourth chunk,permitting the fifth and sixth to pass by un-encoded.

The detector running on the fast processor is clearly more difficult to“fool,” and yields a decoded watermark of higher confidence. But bothsystems decode the watermark, and both operate in “real time.”

The skipping of input data in the temporal (e.g. music or video) orspatial (e.g. image or video) domain is but one example of howscaleability can be achieved. Many other approaches are known to thoseskilled in the art. Some of these alternatives rely on spending more orless time in the data analysis phases of watermark decoding, such ascross-correlation operations.

Reference has been made to watermarked UIDs as referring to a databasefrom which larger data strings (e.g. web addresses, musician names,etc.) can be retrieved. In some embodiments, the data record referencedby a UID can, in turn, point to several other database records. By sucharrangements, it is often possible to reduce the payload of thewatermark, since a single UID reference can lead to several differentdata records.

Production Tools

In the prior art, the watermark embedded in a source material istypically consistent and static through a work—unchanging from beginningto end. But as will be recognized from the foregoing, there are manyapplications that are better served by changing the watermark datadynamically during the course of the work. According to another aspectof the invention, a production tool is provided that facilitates theselection and embedding of dynamically-changing watermark data. One suchembodiment is a software program having a user interface thatgraphically displays the different watermark fields that are beingembedded in a work, and presents a library of data (textually or byicons) that can be inserted into each field, and/or permits the user totype in data to be encoded. Another control on the UI controls theadvance and rewind of the media, permitting the user to determine thelocation at which different watermark data begins and ends. Graphicalparadigms known from video- and audio-editing tools can be used toindicate the starting and ending frames/samples for each differentwatermark payload.

Such a tool can be of the standalone variety, or can be integrated intothe desktop audio- and video-production and editing tools offered byvendors such as Avid, Adobe, Jaleo, Pinnacle Systems, SoundForge, SonicFoundry, Xing Technology, Prosoniq, and Sonic Desktop Software.

Payment-Based Systems

Another aspect of the present invention is the use of anonymous paymenttokens that can be used to obtain content on the web. In one embodiment,a token comprises a 128-bit pseudo-random number, to which additionalbits identifying an issuing bank (or other issuing institution) areappended. (The additional bits can be the IP address of a web server ofthe bank, a routing number identifying the bank for electronic wiretransfers, or other identifier.) The 128-bit numbers are randomlygenerated by the bank—commonly as needed—and each represents a fixedincrement of money, e.g. ten cents.

A consumer wishing to have a store of currency for such commerce paysthe bank, e.g., $10 in exchange for 100 tokens. These tokens aretransferred electronically to disk or other storage in the consumer'scomputer in response, e.g., to a credit card authorization, or may beprovided by diskette or other storage medium over the counter at a bankbranch (in which case the consumer thereafter copies the numbers intostorage of his or her computer). (Outlets other than banks can of coursebe employed for distributing such numbers, much in the manner thatconvenience and many grocery stores commonly issue money orders.)

Imagine that the consumer wishes to view the final quarter of aTrailblazer basketball game that aired on television a week ago. (Theconsumer may have either missed the game, or may have seen it but wantsto see the last quarter again) The user directs a web browser to a website maintained for such purpose and performs a search to identify thedesired program. (Typically, the web site is maintained by theproprietor that holds the copyright in the material, but this need notbe the case. Some material may be available at several web sites, e.g.,maintained by ABC Sports, the National Basketball Association, andSports Illustrated.) The search can use any of various known searchengines, e.g., Infoseek, Verity, etc., and can permit searching by titleterms, keywords, date of airing, copyright owner, etc. By typing in,e.g., the keyword ‘Trailblazers’ and the date ‘4/26/99,’ the consumer ispresented a listing of videos available for download. One, hopefully, isthe requested game. With each listing is an indication of an associatednominal charge (e.g. 80 cents).

On clicking on a hypertext link associated with the desired basketballgame, the viewer is presented a further screen with one or more options.The first of the listed options is the entire game, with commercials.The charge is the nominal charge presented on the earlier screen (i.e.80 cents). Other options may include the first, second, third, andfourth quarters of the game individually, each of which—save the last,costs 20 cents. The last may be charged at a premium rate, e.g., 30cents. Clicking on the desired video option yields a further screenthrough which payment is effected.

To pay for the requested video, the consumer instructs his or hercomputer to transfer three of the earlier-purchased tokens over the webto the video provider. Various user interface metaphors can be employedto facilitate this transfer, e.g., permitting the user to type theamount of money to be transferred in a dialog box presented on-screen,or dropping/dragging icons representing tokens from an on-screen“wallet” to an on-screen “ticket booth” (or over an icon or thumbnailrepresenting the desired content), clicking on an “increment” counterdisplayed adjacent the listing of the content, etc. Once the consumerhas authorized a transfer of sufficient tokens, the consumer's computersends to the web site (or to such other web address as HTML encoding inthe viewed web page may indicate) the tokens. This transmission simplytakes the form of the three 128+bit numbers (the ‘+’ indicating the bankidentifier)—in whatever packet or other format may be used by theinternet link. Once dispatched in this manner, the tokens are deletedfrom the user's computer, or simply marked as spent. (Of course, inother embodiments, a record of the expenditure may be stored in theconsumer's computer, e.g., with the token contents and a record of theaudio or video purchase to which they were applied.)

Since the amount of money is nominal, no encryption is provided in thisembodiment, although encryption can naturally be provided in otherembodiments (e.g., either in sending the tokens from the user to the website, or earlier, in sending the tokens to the user). As will be seen,provided that the media provider immediately sends the tokens to thebank in real time, encryption is a nice feature but not mandatory

On receipt of the token data, the web site immediately routes the tokendata to the identified bank, together with an identifier of the mediaprovider or account to which the funds represented thereby are to becredited. The bank checks whether the 128-bit numbers have been issuedby that bank, and whether they have already been spent. If the numbersare valid, the bank updates its disk-based records to indicate that thethree tokens have been spent and that the bank now owes the mediasupplier 30 cents, which it may either pay immediately (e.g., bycrediting to an account identified by the media provider) or as one lumpsum at the end of the month. The bank then sends a message to the website confirming that the tokens were valid and credited to the requestedaccount. (Optionally, a message can be sent to the purchaser of thetokens (if known), reporting that the tokens have been redeemed.)

In response, the web site begins delivery of the requested video to theconsumer. In the illustrated embodiment, the video is watermarked priorto delivery, but otherwise sent in unencrypted fashion, typically instreaming format, but optionally in file format. (Encryption can be usedin other embodiments.) The watermarking in the illustrated embodiment isaccomplished on-the-fly and can include various data, including the dateof downloading, the download site, the destination IP address, theidentity of the purchaser (if known), etc.

The large size of the video and the small charge assessed thereforprovide disincentives for the consumer making illicit copies.(Especially as to archival material whose value decays with time, thereis not much after-market demand that could be served by illicit copies,making third party compilation of such material for re-distributionfinancially unattractive. First run video, and material that keeps ahigh value over time, would not be as well suited for such distribution,and could better employ technology disclosed elsewhere herein.)

In some embodiments, the integrity of the received video is checked onreceipt. This feature is described below in the section entitledWatermark-Based Receipts.

In the illustrative system, nothing in the tokens indicates the identityof the purchaser. The web site knows the IP address of the site to whichvideo was delivered, but need not otherwise know the identity of thepurchaser. The bank would probably maintain a record of who purchasedthe tokens, but need not. In any event, such tokens could thereafter beexchanged among consumers, resulting in anonymity from the bank, ifdesired.

As described above, the video excerpts from which the consumer canselect include commercials. At some sites, video may be provided fromwhich the commercials have been excised, or which is delivered in amanner that skips past the commercials without transmitting same to theconsumer. Such video will naturally command a premium price. In someembodiments, the difference in price is electronically credited ascompensation to accounts maintained for (or by) the advertisers, whoseadvertisements are not being viewed by such consumers. (Theidentification of advertisers to be credited is desirably permanentlyencoded in the video, either throughout the video (if the video has hadthe commercials removed therefrom), or by data in the commercialsthemselves (which commercials are skipped for transmission to theconsumer, but can still be decoded at the video head-end. Such encodingcan be by in-band watermarking or otherwise.)

While the foregoing discussion particularly considered video as thedesired content, the same principles are equally applicable inconnection with audio, still imagery, and other content.

The token-based payment method is but one of many that can be employed;the literature relating to on-line payment mechanisms is extensive, andall such systems can generally be here-employed.

Tracking 128-bit tokens can be a logistical problem for the bank. Oneapproach is to have a memory with 10¹²⁸ locations, and at each locationstore a two-bit value (e.g. 00=never issued; 01=issued but not spent;10=issued and spent; 11=reserved). More complete data couldalternatively be stored, but such a memory would be impractically large.

One alternative approach is to hash each 128-bit number, when issued, toa much smaller key value (e.g. 20 bits). A memory with 10²⁰ locationscan be indexed by this key. Each such location can include four data: anissued 128-bit token number that hashes to that value, first and seconddate fields indicating the date/time on which that token was issued andredeemed, respectively, and a link specifying the address of a nextmemory location. That next memory location (outside of the original 10²⁰locations) can include four more data, this time for a secondissued-128-bit token number that hashed to the original key value, twodate fields, and again with a link to a subsequent storage location,etc.

When a 128-bit random number is generated, the original memory locationindexed by the hash code of that number is checked for an earlier numberof the identical value (to avoid issuance of duplicate tokens). Eachsuccessive location in the linked chain of memory locations is checkedfor the same 128-bit number. When the end of the linked chain isreached, the bank knows that the 128-bit random number has notpreviously been issued, and writes that number in the last-addressedlocation, together with the date of issuance, and a link to a nextstorage location.

When a 128-bit token is received, the same linked-list processing occursto identify a first location, and to thereafter step through eachsubsequent location until a match is found between the token number andthe number stored in one of the linked memory locations. When found,that number is marked as redeemed by writing a redemption date/time inthe corresponding field. If the search reaches the end of the linkedchain without finding a match between the stored numbers and the tokennumber, the token is treated as invalid (i.e. not issued by that bank).

Other manners of tracking the large number of possible token numbers canof course be used; the foregoing is just exemplary. Or the tokensneedn't be tracked at all. Such an arrangement is highly practical ifthe token has sufficient bits. With the illustrated 128 bits, forexample, the chance of two identical tokens being issued isinfinitesimally small, so checking for duplicate issuance can be omittedif desired. In such case, the bank can simply maintain an ordered listof the token numbers still outstanding and valid. As new tokens aredispensed, their token numbers are added to the list. As tokens areredeemed, their numbers are deleted from the list. Known list processingtechniques can be employed to speed such search, update, and deleteactions.

Watermark-Based Receipts

Pay-for-content applications commonly assume that if content istransmitted from a server (or head-end, etc.), it is necessarilyreceived. Sometimes this assumption is wrong. Network outages andinterruptions and internet traffic load can diminish (e.g., droppedvideo frames), or even negate (e.g., failed delivery), expected consumerenjoyment of content. In such cases, the consumer is left to haggle withthe content provider in order to obtain an adjustment, or refund, ofassessed charges.

Watermarks provide a mechanism for confirming receipt of content. If awatermark is detected continuously during a download or other deliveryevent, a software program (or hardware device) can issue an electronicreceipt attesting that the content was properly delivered. This receiptcan be stored, and/or sent to the content distributor to confirmdelivery.

In one embodiment, a content receiving device (e.g., computer,television or set-top box, audio appliance, etc.) periodically decodes awatermark from the received content to confirm its continued reception.For example, every five seconds a watermark detector can decode thewatermark and make a record of the decoded data (or simply record thefact of continued detection of the same watermark). When a changedwatermark is detected (i.e., reception of a different content objectbegins), the duration of the previously-received content is logged, anda receipt is issued.

In a related embodiment, the last portion (e.g., 5 seconds, frame, etc.)of the content bears a different “end of content” watermark thattriggers issuance of a receipt. Such a watermark can indicate the lengthof the content, to serve as a cross-check against the periodic watermarkpolling. (E.g., if periodic sampling at 2 second intervals yields 545samples corresponding to the same content, and if the “end of content”watermark indicates that the content was 1090 seconds long, then receiptof the entire content can be confirmed.)

In another embodiment, the watermark can change during the course of thecontent by including, e.g., a datum that increments every frame or otherincrement of time (e.g., frame number, time stamp, etc.). A watermarkdetector can monitor the continued incrementing of this datum throughoutthe content to confirm that no part was garbled (which would destroy thewatermark) or was otherwise missing. Again, at the end of delivery, thereceiving system can issue a confirmation that XXX frames/seconds/etc.of the identified content were received.

One application of such technology is to bill for content based onreceipt, rather than transmission. Moreover, billings can be adjustedbased on percentage of content-value received. If delivery isinterrupted mid-way through (e.g., by the consumer disabling thecontent-receiving device), the nominal billing for the content can behalved. Some prolonged content, e.g., televised/web-broadcast universityclasses, cannot be “consumed” in one session, and are thus particularlywell suited for such pay-as-you-consume billing.

Another application of such technology is in advertising verification.Presently, ads are tracked by transmission or, less frequently, bydetection of an embedded code on receipt (c.f., Nielsen Media Research'sU.S. Pat. Nos. 5,850,249 and 5,737,025). However, suchreception-detectors—once triggered—generally do not further note thelength of time that the advertising was received, so the same data isproduced regardless of whether only five or fifty seconds of acommercial is presented. Watermark monitoring as contemplated hereinallows the duration of the advertising impression to be preciselytracked.

In one application of this technology, recipients of advertising areprovided incentives for viewing advertising in its entirety. Forexample, a content-receiving device can include a watermark detectorthat issues a receipt for each advertisement that is heard/viewed in itsentirety. These receipts can be redeemed, e.g., for content tokens asdescribed elsewhere herein, for monetary value, etc. In someembodiments, receipts are generic and can all be applied to a desiredpremium, regardless of the advertisements through which they wereearned. In other embodiments, the receipts are associated with theparticular advertisers (or class of advertisers). Thus, a TV viewer whoaccumulates 50 receipts from advertising originating from Procter &Gamble may be able to redeem same for a coupon good for $2.50 off anyProcter & Gamble product, or receipts from Delta Airlines may beredeemed for Delta frequency flier miles (e.g., at a rate of one mileper minute of advertising). Such incentives are particularly useful innew forms of media that give the consumer enhanced opportunities tofast-forward or otherwise skip advertising.

(Although the foregoing “receipt” concept has been described inconjunction with watermark data (and use of watermark technology isbelieved to be inherently advantageous in this application), the sameprinciples can likewise be implemented with ancillary data conveyed byother means.)

Master Global Address

As suggested above, it is desirable that each piece of content have aweb address (the “Master Global Address” (MGA), or “Master IP Address”)associated with it. Such address is typically conveyed with the content,e.g., by an IP address watermarked therein.

Consider a consumer who downloads a streaming video having an Englishlanguage soundtrack. The viewer may not speak English, or may otherwiseprefer to listen to the soundtrack in another language. The user candecode the watermark data embedded in the video and initiate a link tothe associated web address. There the user is presented with a list ofsoundtracks for that content object in other languages. The viewer canclick on the desired language and receive same via a second simultaneoustransmission (e.g., a second socket channel). The consumer's audio/videoappliance can substitute the desired audio track for the default Englishtrack.

If the streaming video and the alternative soundtrack are hosted on thesame server, synchronization is straightforward. The process governingtransmission of the alternative soundtrack identifies the process thatis streaming video to the same IP address. Based on SMPTE, or othertime/frame data, the former process syncs to the latter. (If the twodata streams don't originate through the same server, time/frame datacan be relayed as necessary to the alternative soundtrack server toeffect synchronization.)

Another application of the Master Global Address is to serve as a pointto which monitoring stations can report the presence, or passage, ofcontent. Consider, for example, a copyright-aware node through whichcontent signals pass, e.g., a computer node on a network, a satellitetransponder, etc. Whenever the node detects passage of a media object(e.g., by reference to a file extension, such as MP3, JPG, AVI, etc.),it sends a “ping” over the internet to the address encoded in theobject, simply reporting passage of the object. Similar monitoringfacilities can be provided in end user computers, e.g., reportingFileOpen, FileSave, Printing, or other use of content bearing MGA data.

This system can be expanded to include “ping” and “pong” phases ofoperation. When a software application (or a user appliance, such as avideo or audio playback device) encounters a media object (e.g., at timeof file open, at time of playback, etc.), it pings the MGA site toreport the encounter. The MGA site “pongs” back, responding withinstructions appropriate to the encounter. For example, if the objectrequires payment of a fee before full functionality or access is to begranted, the MGA site can respond to the application with instructionsthat the object be used (e.g., played back) only in some crippled statepreventing the user's full enjoyment (e.g., impaired resolution, orimpaired sound quality, or excerpts only, etc.). The MGA site can alsoinform the user application of the terms (e.g., payment) by which fullfunctionality can be obtained. The application can graphically oraudibly present such information to the user, who can authorize apayment, if desired, so that the content can be enjoyed in a less- (orun-)crippled state. On receipt of the payment authorization, the MGAsite can inform the user application that enhanced access/usage rightshave been purchased, and that the application may proceed accordingly.

Yet another application of the MGA is to present the user of a contentobject a menu of options that is customized to that object.

In current graphical operating systems, when a user clicks on an icon(e.g., with the right mouse button), a menu is presented detailingactions that can be undertaken in connection with the icon, or the filerepresented thereby. Such options are pre-programmed (i.e., static), andare typically determined by the operating system based solely on thefile extension.

In accordance with this aspect of the present invention, clicking on anicon representing a media object initiates an internet link to the MGAsite associated with the object. The MGA site responds with data that isused to customize the menu of options presented to the user inconnection with that particular object.

Consider an icon representing a JPG image file. Right-clicking on theicon may yield a menu that gives the user various options presented bythe operating system (e.g., delete, compress, rename), and additionaloptions customized in accordance with data from the object's MGA site.These customized options may include, e.g.,

-   -   (a) open in 100×150 pixel format for free;    -   (b) open in 480×640 pixel format for ten cents;    -   (c) open in 960×1280 pixel format for twenty cents;    -   (d) purchase rights to use this image in a newsletter having a        circulation of under 1000 for $1.25;    -   (e) display a complete listing of license options.

Clicking on options (b) or (c) initiates a commerce application throughwhich funds are electronically transferred to the MGA site (by theabove-described tokens or otherwise). In response, the MGA site responds(e.g., with TCP/IP or HTML instructions) authorizing an application onthe user's computer to open the file in the requested manner. (Thedefault application for JPG applications can then automatically belaunched, or the computer may first query the user whether anotherapplication should be used instead.)

Clicking on option (d) proceeds as above, and permits full use of theimage on the computer. Moreover, the MGA site sends a digitalcertificate to the user's computer memorializing the usage rightspurchased by the consumer.

In this particular arrangement, no access control is placed on thecontent, e.g., by encryption, secure container technology, or the like.The nominal fees, and the ease of licensing, make it simple for the userto “do the right thing” and avoid copyright liability. In otherembodiments, of course, known access control techniques can be used tolimit use of the object until the requisite payment has been made.

Naturally, records of all such transactions are also logged at the MGAsite.

Clicking on option (e) opens a browser window on the user's computer toa web site that presents a complete listing of license options availablefor that image. (The address of this web site is included incustomization data relayed to the user device from the MGA site, but notexplicitly shown to the user on the menu.) Through such web site, theuser can select desired rights, effect payment, and receive thenecessary authorization for software applications on the user's computer(or other media appliance) to open and/or process the content.

The object on which the user “clicks” needn't be an icon. It can be animage or other graphical representation. (And a “click” isn't necessary;a voice command or other signal may be used to the same effect with anaudio clip or selection.)

Consider the popular merchandising of books and CDs over the internet. AJPG or other image file depicting the cover of a book, or the artwork ofa CD cover, can be treated as a media object, and can include awatermarked MGA pointer. Right-clicking on such an image of a book covercould, through the MGA site, present to the user a menu of options thatincludes—in addition to those normally presented in conjunction with aJPG file—the following:

-   -   (a) “See the review of this book published in the New York Times        on Apr. 19, 1999”    -   (b) “See the list of reviews of this book at Amazon.com”    -   (c) “Enter your own review of this book, for posting on        Amazon.com”    -   (d) “See today's sales rank of this book at Amazon.com”    -   (e) “Purchase this book from Amazon.com for $16.95”    -   (f) “Purchase this book from Barnesandnoble.com for $19.95 and        receive a $5.00 credit towards your next purchase”    -   (g) “Link to the web site that tells about the release of this        title as a motion picture (presently scheduled to open on Oct.        10, 1999)”    -   (h) “Link to the Yahoo listing of web sites relating to this        book”    -   (i) “Search Lycos for listings relating to this book.”

If the user selects one of the purchase options from the menu, apre-stored e-commerce profile—containing the user name, credit cardnumber, billing address, ship-to address, etc., possibly in the form ofan encrypted object—could be sent to the MGA site (or to the bookseller)to effect the purchase, or such selection could initiate display ofadditional screens or sub-menus through which the user would manuallyenter or select such information for transmission.

Others of the selections cause a new browser window to open on theuser's computer, opening to a URL specified in data relayed from the MGAsite but not displayed to the user in the menu. Appropriate HTMLinstructions can be generated to effect a particular query or otheroperation at the specified URL.

In some embodiments, the customized menu presents only a single choicein addition to those normally provided by the operating system, e.g.,“Link to home.” Clicking on this option opens a browser window to a homepage at the MGA for that object. On that page, the user is presentedwith all of the foregoing options, and more (possibly includingadvertising graphics or multi-media). Such objects can serve as powerfulmarketing agents. Returning to the example discussed above, a JPG imagefile of a book cover may have, as its MGA, a web page hosted by aparticular bookseller, providing purchase options and other informationfor that book. Marketing of books (or CDs, or cars, or consumerappliances, or virtually anything else) can be effected by disseminatingsuch vendor-issued JPGs as widely as possible. Some book cover JPGs maybe distributed by Amazon.com, others by Barnes&Noble.com, others byBorders.com—each pointing back to a different MGA through which purchasetransactions for that book may be performed.

Returning to the MGA-customized menus, these needn't be limited to menusresulting from clicking on an icon or image (or signaling during anaudio excerpt). Drop-down menus in application programs can likewise bepopulated with customized options, in accordance with customization dataobtained from the MGA site for the object presently being accessed orused. Most graphical operating systems and application programs havewell developed toolsets permitting such menu customization. Again, otherdata relayed from the MGA site is not shown to the user, but is employedby the computer (e.g., a browser program) to carry out menu optionsselected by the user.

Again the foregoing techniques are equally applicable for still images,audio, video, and other forms of content, and can readily be adapted foruse both with general purpose computers, software applications, andspecialized media appliances.

While, for expository convenience, the foregoing discussion contemplatedembedding a literal URL address in the object as the MGA, more typicallythis is not the case. Instead, the MGA more commonly comprisesidentification data for the object (e.g. a 128-bit random ID), togetherwith the URL for a name server computer that serves many (perhapsmillions) of such objects (an example of the latter is the DigimarcMarcCentre server).

To obtain the desired data as detailed above, the user's computer(sometimes termed a client computer) links to the name server computerand provides the ID of the object being processed. The name servercomputer uses this ID to query a database, and obtains from the databasethe current IP address to which such queries should be routed. The nameserver computer can relay the request from the client computer to thecorrect destination address, or can return the correct destinationaddress to the client computer, which can initiate such a link itself.By such arrangement, the IP address ultimately associated with an objectcan be easily changed as needed, simply by changing the correspondingrecord in the name server database, without rendering obsolete legacyobjects having out-of-date addresses encoded therein.

In some embodiments, the URL of the name server needn't be included inthe watermark. In the absence of a specified URL, the client computermay direct such links to a default name server address instead (storedlocally or remotely). If that server doesn't recognize the object ID, itcan return an error code, or pass the query on to other name servers.Those servers, in turn, can pass the query along to still other nameservers if they don't recognize the object ID. In this fashion, anexponentially-large number of name servers might be quickly polled forinformation relating to the identified object. Alternatively, ratherthan encoding the complete IP address of the name server in an objectwatermark, the first N (e.g., 16) bits of the object ID might be used asa short-hand for one of 65,536 predetermined name server addresses, inaccordance with data stored locally (e.g., on RAM or disk in the user'scomputer) or remotely (e.g., at a default name server IP address).

While the basic concept idea behind embedding MGA data within an objectis to point to a repository of data about the object, a pointer theother way may be achieved as well.

As noted, the “ping” application of MGA data permits an MGA site to beinformed of sites through which its object passes. More generally, theMGA site can log the originating address of each query it receives. Eachsuch address can be presumed to have (or have had) a copy of thecorresponding object. Media owners can thereby track the disseminationof copies of their media objects—at least insofar as use of such objectsentails communicating with the associated MGA site.

Such tracking offers a great number of opportunities, some in the areaof commerce. The MGA site corresponding to the cover art of a GarthBrooks CD, for example, can provide a listing of IP addresses of personsinterested in that CD. Email or promotional data objects (e.g., audioclips) can be sent to that list of addresses when a subsequent GarthBrooks CD is released.

Such tracking also opens up a new dimension of internet searching.Presently, internet search engines use a brute force approach, visitingmillions of pages across the web in order to identify, for example, adozen instances of a given photograph file. MGAs offer a shortcut tosuch brute force approaches. With the present technology, a searchengine can find a single instance of a photograph file and, by detectionof the MGA data watermarked therein, link to the corresponding MGA site.From the MGA site, the search engine can obtain a listing (if suchqueries are authorized) of some or all of the other sites known by theMGA site to have copies of that photograph file. (Providing such data tosearch engines is a commerce opportunity for such MGA sites, which maypermit such access to its listing of sites only in exchange for a fee.Or the MGA site may arrange to collect a tribute payment from the searchengine proprietor each time the engine responds to a user query usingdata collected from the MGA site.)

Many of the addresses logged by the MGA may not be publicly-accessibledata stores. The search engine can check each listed address to ensurethat the desired object is present and accessible before adding theaddress to its database.

Covert Tracing

Application Ser. No. 09/185,380 (now U.S. Pat. No. 6,549,638) describesanti-counterfeiting technology that looks for the presence of digitaldata corresponding to bank note imagery in a computer system, and makesa covert record of any attempt to process such data (e.g., Scan,FileOpen, FileSave, Print, Edit, etc.). Such records are hidden from theuser of the system (using, e.g., various data encryption and obscuringtechniques), but authorized law enforcement officials are provided toolsby which these records can be recovered. The forensic data therebyobtained may prove useful in prosecuting counterfeiters. (Knowledge thata computer may be covertly storing evidence of attempted counterfeitingactions may prove as, or more, valuable in deterring counterfeiting thanthe covert records themselves)

The same techniques can be employed to deter unauthorized processing ofaudio, image, video, or content by media pirates. In one embodiment, acomputer's operating system (including peripheral device drivers)monitors various data within the system (e.g., data sent to writeablestorage media, or sent via a serial port or network connection, etc.)for data bearing a do-not-copy watermark. The presence of such databeing sent, e.g., to a writeable disk or to a remote computer, indicatesthat the do-not-copy instruction has been circumvented. In such case,the operating system writes one or more covert records memorializing theactivity, for possible use in criminal prosecution if the computer islawfully seized.

The example just-provided is but one of many monitoring and responsetechniques that may be employed to deter circumvention ofcopy-protection or other access control systems. Generally speaking, ifcontent data is found where it shouldn't be, or is found used as itshouldn't be used, a corresponding record should be made. (Otherintervention actions can be triggered as well; covert tracing isdesirably just one of several parallel responses to suspected hacking.)

Meta-Data Accessed Using Watermarks

Meta-data, in formats known as XML, SGML, and HTML, is widely used tocommunicate information about digital objects (e.g., author, keywords,price, rights, caption, etc.). More generally, meta-data can be thoughtof as any data construct which associates the name of a property (e.g.,“author), with the value of the property (e.g., “Mark Twain”). Such datacommonly appears in a tag format, such as the following:

-   -   <META NAME=“author” CONTENT=“Mark Twain”>

Meta-data is commonly exchanged between server and client computers inconjunction with the digital objects to which they relate (e.g., thetext of a Mark Twain book).

As detailed herein, an important application of watermarking is likewiseto convey information about media—in this case embedded within the mediacontent itself (e.g., providing unique identification, establishing somebasic behaviors such as do not copy, and providing links to extendedfunctionality).

For meta-data to be useful, it must be linked to associated content,whether in the context of a browser, application program, operatingsystem, asset management system, search engine, etc. However, asdetailed below, the content and the associated meta-tags needn't alwaysbe conveyed together.

Consider an application program or other client process that receives awatermarked media object. The watermark includes an MGA for that object(which, as noted above, may not specify an ultimate IP address). Storedat the MGA site is meta-data corresponding to the object. By linking tothe MGA site identified by the object's watermark, the client computercan obtain the meta-data corresponding to the object. This data can bestored at the client computer and used just as any other meta-data,e.g., to define the local functions that should be available for usewith that object (e.g., buy, search, etc.)

A particular example is an on-line catalog of stock photography. Eachphotograph is watermarked with MGA data. To identify the photographer,copyright date, price, telephone number, subject, etc., an applicationprogram can link to the MGA site for that photograph, and obtain thecorresponding meta-data. This data can then be displayed or used asneeded. Data objects of disparate formats thus can readily be handledwithin a single, simple application program, since the program needn'tconcern itself with the varying formats for the associated meta-data(assuming the name servers provide this data in standardized format).Substantial flexibility in programming and object formatting is therebyachieved.

Returning to the internet search engine example described above, MGAsmay become recognized as repositories rich in meta-data for mediaobjects. Specialized search engines may focus their data collectionaround such sites, and be able to quickly identify the MGA sitescorresponding to various boolean combinations of meta-tag parameters.

Asset Management/Containers

Much has been written on the topic of asset rights management. Samplepatent documents include U.S. Pat. Nos. 5,892,900, 5,715,403, 5,638,443,5,634,012, 5,629,980 and laid-open European application EP 862,318. Muchof the technical work is memorialized in journal articles, which can beidentified by searching for relevant company names and trademarks suchas IBM's Cryptolope system, Portland Software's ZipLock system, theRights Exchange service by Softbank Net Solutions, and the DigiBoxsystem from InterTrust Technologies.

An exemplary asset management system makes content available (e.g. froma web server, or on a new computer's hard disk) in encrypted form.Associated with the encrypted content is data identifying the content (eg. a preview) and data specifying various rights associated with thecontent. If a user wants to make fuller use of the content, the userprovides a charge authorization (e.g. a credit card) to the distributor,who then provides a decryption key, allowing access to the content.(Such systems are often realized using object-based technology. In suchsystems, the content is commonly said to be distributed in a “securecontainer.”)

Desirably, the content should be marked (personalized/serialized) sothat any illicit use of the content (after decryption) can be tracked.This marking can be performed with watermarking, which assures that themark travels with the content wherever—and in whatever form—it may go.The watermarking can be effected by the distributor—prior todissemination of the encrypted object—such as by encoding a UID that isassociated in a database with that particular container. When accessrights are granted to that container, the database record can be updatedto reflect the purchaser, the purchase date, the rights granted, etc. Analternative is to include a watermark encoder in the software tool usedto access (e.g. decrypt) the content. Such an encoder can embedwatermark data in the content as it is released from the securecontainer, before it is provided to the user. The embedded data caninclude a UID. This UID can be assigned by the distributor prior todisseminating the container. Alternatively, the UID can be a data stringnot known or created until access rights have been granted. In additionto the UID, the watermark can include other data not known to thedistributor, e.g. information specific to the time(s) and manner(s) ofaccessing the content.

As noted earlier, access rights systems can be realized with watermarkswithout containers etc. For example, in a trusting world, copyrightedworks can be freely available on the web. If a user wishes to makelawful use of the work, the user can decode its watermark to determinethe work's terms and conditions of use. This may entail linking to a website specified by the embedded watermark (directly, or through anintermediate database), which specifies the desired information. Theuser can then arrange the necessary payment, and use the item knowingthat the necessary rights have been secured.

Remote Reconfiguration of Watermark Detectors

In some cases, it is desirable to reconfigure watermark detectorsremotely. Such functionality is desirable, for example, if a watermarksystem is hacked or otherwise compromised.

In accordance with this aspect of the present invention, some aspect ofa watermark detector's operation is changed in response to a command.The change can take various forms. In watermark systems employingpseudo-random key data (e.g., spread spectrum spreading signals), thepseudo-random signal used for detection can be changed. In systems usingDFT processing, the mapping between message bits and DFT coefficientscan be changed. In still other systems, the decoding can proceed asbefore, but the significance of one or more bits can be changed (e.g.,bits that were normally interpreted as defining Field A can beinterpreted as defining Field B, and vice versa). In yet other systems,the decoding can proceed as before, but the response of a device to agiven watermark signal can be changed. In still other systems, a set ofsoftware instructions can be re-written or re-ordered to effect a changein detector operation.

The command can be conveyed in various ways. In one embodiment, it canbe a trigger bit in the watermark payload. Normally the bit has a valueof “0.” If the bit has a value of “1,” the detector system responds bychanging its operation. A trigger pattern can also be established, sothat detection of a certain combination of bits in the watermark payloadserves to trigger the change. Reserved states of certain data fields areexamples of patterns that might be employed.

The command can also be conveyed through another channel different thanthe watermark channel (e.g., an SCA channel of an FM broadcast, or thesub-titling data channel of video broadcasts, or header data within anMPEG data stream, etc., etc.).

The change can proceed in accordance with a pre-programmed rule (e.g.,codes progressing successively through a numerically oralgorithmically-determined progression), or the change can proceed inaccordance with data specified elsewhere in the payload of the watermarkbearing the trigger bit (e.g., instead of being interpreted in normalfashion, the non-trigger bits of the detected watermark can define a newpseudo-random key data. Or the change can proceed in accordance withdata conveyed in successively-presented watermark payloads, as might bedone in video encoding where each frame of video can convey furtherwatermark information. (This latter arrangement is one offering ahigh-bandwidth reprogramming channel through which, e.g., extensivefirmware instructions might be transferred to the detector to replaceinstructions earlier stored.)

By such arrangements, greatly increased detector versatility andfunctionality can be achieved.

Conclusion

Many diverse embodiments are reviewed above—each with a unique set offeatures. (Still others are disclosed in the assignee's patentsincorporated by reference.) This specification should be construed asexplicitly teaching that features illustrated in one such embodiment cangenerally be used in other embodiments as well. Thus, for example, adate field was not particularly discussed in connection with payloaddata for video watermarking. Nor were “play once” watermarksso-considered. The inclusion of a calibration signal with (or as partof) the watermark is shown in embodiments of the issued patents, but isnot belabored in the above-described embodiments. Likewise with “simpleuniversal codes.” The pre-stored commerce profile described in one ofthe foregoing embodiments is equally applicable to other embodiments aswell. Likewise, the presentation of advertising was discussed inconnection with one embodiment but not others, although it, too, isgenerally applicable. All of these concepts are familiar at Digimarc andare regarded as generally applicable throughout the work expressed inDigimarc's patent disclosures. Practicality prevents an exhaustiverecitation of each individual permutation and combination.

Having described and illustrated the principles of our invention withreference to illustrative embodiments, it will be apparent that thedetailed arrangements can be modified in arrangement and detail withoutdeparting from such principles.

For example, while reference has been made to various uses of wireless,it should be understood that such reference does not just cover FMbroadcast, and wireless internet networking and the like, but alsoincludes other wireless mechanisms. Examples include cell phones anddirect satellite broadcast.

Likewise, while certain embodiments were illustrated with a watermarkpayload of 100+bits, in other systems much smaller (or sometimes larger)payloads are desirable—sometimes as small as 1-8 bits.

While the foregoing examples have each been illustrated with referenceto a particular media type (e.g., video, audio, etc.), it will berecognized that the principles of each embodiment find application withthe other media types as well.

Certain of the appliances contemplated above require user interfacesmore sophisticated than are presently typical on such devices. Thesimplicity of the underlying audio appliance can be preserved, in manyinstances, by using a palmtop computer—coupled by infrared orotherwise—as a temporary user interface to the appliance. Some of theprocessing capability can likewise be off-loaded to an ancillarypalmtop. (Palmtop is here meant to refer generally to any pocket-sizeprogrammable computing device.)

Unless otherwise stated, it should be understood that the digital music,video, and imagery contemplated herein is not of any particular form orformat. Audio, for example, can be of various forms, both streaming andnon-streaming, and of various formats (e.g. MP3, MP4, MS Audio, WindowsMedia Technologies, RealAudio, *.WAV, MIDI, Csound, Dolby's AdvancedAudio Codec (AAC), etc.

To provide a comprehensive disclosure without unduly lengthening thepresent specification, applicants incorporate by reference the patentpublications and applications cited herein.

We claim as our invention all such embodiments as may come within thescope and spirit of the following claims, and equivalents thereto.

APPENDIX B Application 09/679,261 Peripheral Device for a ComputerSystem Description

The present disclosure memorializes certain improvements to the subjectmatter detailed in pending application Ser. No. 09/343,104 (Jun. 29,1999), and Ser. No. 09/292,569 (Apr. 15, 1999), the disclosures of whichare incorporated by reference.

The cited '104 application details a variety of systems in which objectsinteract with computer devices. The objects can be physical objects,marked with machine-readable indicia, such as digital watermarks.Optical input devices, such as webcams, are used to capture image datafrom the object, so that the computer device can recognize the objectand respond accordingly.

In the '104 application, the disclosed technology was referred to by thename “Bedoop.” The present assignee now markets such technology underthe Digimarc MediaBridge name. The former term is used in thisdisclosure.

One form of optical input device usable in such systems is a mouse-likeperipheral that includes an optical sensing system. The optical sensingsystem can comprise a 1D array of plural optical sensors (e.g., CCD,CMOS, etc.), or a 2D array. Such devices are already known in othercontexts, e.g., the Microsoft IntelliMouse with IntelliEye technology.That device includes a multi-element CMOS optical sensor integrated onan IC with various detector and processing circuitry, operating inconjunction with a short focal length imaging lens and an LEDillumination source. The circuitry tracks movement of patterns acrossthe sensor's field of view, by which the mouse's movement can bededuced. The Microsoft product collects 1500 data sets per second—aframe rate much higher than is needed in most embodiments of theassignee's Bedoop technology.

Such a mouse-like peripheral can omit the buttons and position-sensingfeatures commonly provided on traditional mice, yielding a simpledesk-facing palm camera that generates frames of data corresponding to asmall area under the sensor portion of the mouse. More typically,however, the peripheral includes the buttons, roller wheels, and/orX-/Y-position sensing arrangements of traditional mice so that buttonand positional forms of data input can be exploited in interacting withthe Bedoop application.

The optical data collected by the sensor can be processed within theperipheral's processor to extract the steganographically encoded binaryBedoop data therefrom. Or this processing burden can be undertaken bythe associated computer system, with the peripheral simply processingand formatting the raw sensor data into sequential frames of image datato be output to that system.

Any form of hand-held scanner—whether of the type just described orothers known in the art—offers a convenient way to interact with catalogadvertising. Imagine a traditional paper catalog, e.g., from L:L. Bean,Inc., or Lands End. Each image in the catalog is Bedoop-encoded with acode that identifies the depicted product. A user browsing through thecatalog, on seeing a product of interest, places the scanner over thepicture (and optionally may be required to push a button or otherwisesignal to initiate further processing). The scanner detects the Bedoopdata and relays it to an associated computer (optionally with dataidentifying the consumer). The computer polls a remote server computermaintained by the merchant, which responds with data corresponding tothe item depicted in the scanned image. This returned data can includedata indicating the sizes available, data indicating the colorsavailable, data indicating the variant styles available, flag bitsindicating whether each item is in stock, etc. This returned data can bepresented to the consumer—typically on a display device butalternatively in audible form.

Preferably, the customer's body measurements (waist size, inseam length,neck size, etc.) are stored in a user profile, either on the localcomputer or at the merchant's server computer. This allows the system tocustomize the data presented to the user—e.g., showing the color optionsand availability only for the depicted shirt in a 16 inch neck and a 34inch sleeve length.

If necessary, the user can select among the color or style options,using the handheld input device (either buttons, gestures, etc.), or anyother input device. Or the item may be one for which no furtherspecifications are needed. In either event, once the desired product hasbeen sufficiently specified, the user can signal the system to place theorder. Payment and shipping details can be arranged through any of thegreat variety of techniques known in the art, e.g., by charging to acredit card number and shipping to an address on-file with the merchant.

While scanning peripherals of the type described above are typicallywired to an associated host system, wireless links (e.g., radio,infrared, ultrasonic, etc.) can of course be used, freeing the user fromthe constraint imposed by the cable.

Another use of the technology detailed in the '104 application (andother applications and patents of the present assignee, including U.S.Pat. No. 5,841,886—incorporated herein by reference) is to controlbuilding access (or facility access, or room access, etc.) through acombination of an ID card, Bedoop technology, and proximity detectiontechnology.

The ID card can be a badge or the like having asteganographically-encoded photograph of the bearer. The card furtherincludes a proximity ID device, such as an unpowered electronic circuitthat is excited and detected by a radiant field from an associatedproximity detector, providing a unique signature signal identifying aparticular individual.

The building can be provided with an image sensor (such as a videocamera or the like), an associated Bedoop detection system, and theproximity detector. When a user wearing the badge approaches, theproximity detector signals the camera to capture image data. The Bedoopdetection system identifies the badge photograph (e.g., by clues as aredescribed in the prior applications, or without such aids), capturesoptical data, and decodes same to extract thesteganographically-embedded data hidden therein. The access controlsystem then checks whether the badge ID discerned from the proximitysensor properly corresponds to the Bedoop data extracted from thephotograph on the badge. If so, access is granted; if not, the data islogged and an alarm is sounded.

By such arrangement, premises security is increased. No longer canproximity-based access badges be altered to substitute the picture of adifferent individual. If the photo is swapped, the proximity system IDand the embedded photo data will not match, flagging an unauthorizedattempted access.

The same principles are applicable in many other contexts—not limited toRF-based proximity detection systems. For example, the data decoded fromthe photograph can be compared against other forms of machine-sensedpersonal identification associated with the badge. These include, butare not limited to, bar code IDs, mag-stripe ID cards, smart cards, etc.Or the comparison can be with an identification metric not associatedwith the badge (e.g., retinal scan, voice print, or other biometricdata).

Having described an illustrated the principles of our inventions withreference to specific embodiments, it will be recognized that theprinciples thereof can be implemented in many other, different, forms.Moreover, the particular combinations of elements and features in theabove-detailed embodiments are exemplary only; the interchanging andsubstituting of these teachings with teachings in theincorporated-by-reference applications and patent are also contemplated.

1. In a mouse having X- and Y-position encoders, and circuitryresponsive thereto for generating X- and Y-movement data and providingsame to an associated computer, an improvement comprising an opticalsensor disposed in said mouse, circuitry responsive thereto forproducing grey-scale optical image data and for providing same to theassociated computer, and a semiconductor substrate, the substrate havingformed thereon both the optical sensor and said circuitry, wherein themouse serves both as a positioning device and as an optical inputdevice, and wherein said substrate also has formed thereon a secondcircuit, for decoding plural bit auxiliary data steganographicallyencoded within said optical image data, and for providing data relatedthereto to said associated computer.
 2. A method of using an opticalmouse, the mouse including an optical sensor with several elements, andcircuitry coupled thereto for producing grey scale image data, themethod comprising: positioning the mouse over a substrate, the substratehaving user-perceptible printing thereon, said printing havingplural-bit auxiliary data steganographically encoded therein; capturingimage data corresponding to said printing; and from said image data,decoding the plural-bit auxiliary data steganographically encoded insaid printing.
 3. The method of claim 2 in which the substrate includesplural regions, each encoded with a different set of auxiliary data, anda desired set of auxiliary data is decoded by positioning the mouse overa corresponding region.
 4. The method of claim 2 in which the substrateis a page in a bound collection of pages, and the method includespositioning the mouse over said page.
 5. The method of claim 4 in whichthe bound collection of pages is a magazine.
 6. The method of claim 4 inwhich the bound collection of pages is a book.
 7. The method of claim 2in which the substrate is a mouse pad.
 8. The method of claim 2 in whichthe mouse is coupled to an associated computer device, and the methodincludes decoding the plural-bit auxiliary data using a processor insaid computer device.
 9. The method of claim 2 that includes directing aweb browser to a web address identified in accordance with theplural-bit auxiliary data decoded from the image data.
 10. A method ofusing an optical mouse, the mouse including an optical sensor withseveral elements, and circuitry coupled thereto for producing grey scaleimage data, the method comprising: positioning the mouse over a texturedsubstrate, the texturing having plural-bit auxiliary datasteganographically encoded therein; capturing image data correspondingto said texturing; and from said image data, decoding the plural-bitauxiliary data steganographically encoded in said texturing.
 11. Aperipheral device for use with a computer system comprising: a housingadapted to fit within a user's palm and slide over a medium; an opticalsensor having plural sensing elements and producing image signals; alens for imaging the medium onto the sensor; circuitry coupled to thesensor and disposed within the housing for processing the signals fromthe sensor and producing corresponding output data; and a wireless linkfor relaying the output data from the peripheral device to the computersystem; wherein said sensor is useful in acquiring optically-encodedmulti-bit information from said medium for use by said computer system.12. The device of claim 11 in which the circuitry within said housinganalyzes the image signals and produces multi-bit informationcorresponding thereto.
 13. The device of claim 11 in which the circuitrycomprises a decoder for discerning steganographically-encodedinformation represented in said image signals.
 14. The device of claim11 in which the optically encoded information comprises a plural-bitidentifier.
 15. A system including a device and a computer apparatuscomprising: a housing adapted to fit within a user's palm and positionover a medium; an optical sensor having plural sensing elements andproducing image signals; a lens for imaging the medium onto the sensor;circuitry coupled to the sensor and disposed within the housing forprocessing signals from the sensor corresponding to a machine-readableindicia, and producing corresponding multi-bit binary output datadecoded from said indicia; and transfer means for relaying the outputdata from the device to the computer apparatus; wherein said circuitryis integrated on a common substrate with said sensing elements.
 16. Thesystem of claim 15 in which the machine-readable indicia comprisesartwork that is steganographically encoded to convey multi-bit binarydata.
 17. The system of claim 16 in which the artwork comprises aphotograph.
 18. The system of claim 16 in which: the device includes auser interface control; and the circuitry performs steganographicdecoding only when said control is activated by a user.
 19. The systemof claim 15 in which the multi-bit data is associated with an internetaddress, and the system includes a display for presenting image datadownloaded from said internet address.
 20. A method of interacting withprinted material using a peripheral device, the peripheral deviceproviding positional data to an associated computer and including anoptical sensing system comprising plural optical sensing elements, themethod comprising: positioning the device over the printed material;generating optical sensor data from said optical sensing system, saiddata corresponding to a machine-readable indicia formed on the printedmaterial; performing a steganographic decoding process on said opticalsensor data to produce plural-bit data corresponding to saidmachine-readable indicia; and providing said plural-bit data to saidcomputer.
 21. The method of claim 20 in which the machine-readableindicia is hidden in artwork.
 22. The method of claim 21 in which theartwork comprises a photograph.
 23. The method of claim 20 in which theplural-bit data is associated with an internet address, and the methodfurther includes downloading data to said computer from said internetaddress.
 24. The method of claim 20 in which the peripheral deviceincludes a user interface control, and the method includes performingsaid steganographic decoding process only when said control is activatedby a user.
 25. The method of claim 24 in which the control is a button.26. The method of claim 20 in which said steganographic decoding isperformed by apparatus within the peripheral.
 27. The method of claim 26in which plural bit data from said decoding is transmitted, by wirelesstransmission, to an apparatus remote from the peripheral.
 28. The methodof claim 20 in which at least part of said optical sensor data istransferred from the peripheral, and said steganographic decoding isperformed on said data in apparatus remote from said peripheral.
 29. Themethod of claim 28 in which said optical sensor data is transferred fromthe peripheral to said remote apparatus by wireless transmission.
 30. Adevice including a substrate having formed thereon both a 2D array ofoptical sensing elements for producing sensor data, and a steganographicdecoder coupled to said elements to effect a steganographic decodingoperation on said sensor data.